Overview
Brought to you by YData
Dataset statistics
| Number of variables | 149 |
|---|---|
| Number of observations | 1926393 |
| Missing cells | 160886057 |
| Missing cells (%) | 56.1% |
| Total size in memory | 2.1 GiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Text | 149 |
|---|
Dataset
| Description | Invertebrate Zoology NMNH Extant Specimen Records 0052489-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.fya67r |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "IZ" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
materialSampleID has constant value "NORTH_AMERICA" | Constant |
eventID has constant value "North Pacific Ocean, Gulf Of California" | Constant |
samplingEffort has constant value "24.1667" | Constant |
fieldNotes has constant value "-110.283" | Constant |
georeferencedDate has constant value "8" | Constant |
latestEonOrHighestEonothem has constant value "US" | Constant |
earliestEraOrLowestErathem has constant value "Idaho" | Constant |
earliestAgeOrLowestStage has constant value "NORTH_AMERICA" | Constant |
latestAgeOrHighestStage has constant value "North Pacific Ocean, Departure Bay" | Constant |
bed has constant value "Moultrie" | Constant |
identificationRemarks has constant value "-83.7685" | Constant |
acceptedNameUsage has constant value "SPECIES" | Constant |
parentNameUsage has constant value "GEOLocate" | Constant |
namePublishedIn has constant value "ACCEPTED" | Constant |
subgenus has constant value "false" | Constant |
cultivarEpithet has constant value "108" | Constant |
protocol has constant value "EML" | Constant |
relativeOrganismQuantity has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
recordNumber has 1804640 (93.7%) missing values | Missing |
recordedBy has 764111 (39.7%) missing values | Missing |
sex has 1802980 (93.6%) missing values | Missing |
lifeStage has 1888856 (98.1%) missing values | Missing |
disposition has 1926391 (> 99.9%) missing values | Missing |
associatedOccurrences has 1926391 (> 99.9%) missing values | Missing |
associatedReferences has 1926391 (> 99.9%) missing values | Missing |
associatedSequences has 1921269 (99.7%) missing values | Missing |
associatedTaxa has 1926391 (> 99.9%) missing values | Missing |
occurrenceRemarks has 1144485 (59.4%) missing values | Missing |
verbatimLabel has 1926391 (> 99.9%) missing values | Missing |
materialSampleID has 1926391 (> 99.9%) missing values | Missing |
eventID has 1926392 (> 99.9%) missing values | Missing |
fieldNumber has 1339759 (69.5%) missing values | Missing |
eventDate has 688611 (35.7%) missing values | Missing |
startDayOfYear has 842313 (43.7%) missing values | Missing |
endDayOfYear has 842311 (43.7%) missing values | Missing |
year has 689273 (35.8%) missing values | Missing |
month has 800939 (41.6%) missing values | Missing |
day has 887053 (46.0%) missing values | Missing |
verbatimEventDate has 1173199 (60.9%) missing values | Missing |
habitat has 1857136 (96.4%) missing values | Missing |
samplingEffort has 1926392 (> 99.9%) missing values | Missing |
fieldNotes has 1926392 (> 99.9%) missing values | Missing |
locationID has 984066 (51.1%) missing values | Missing |
higherGeography has 67831 (3.5%) missing values | Missing |
continent has 1027391 (53.3%) missing values | Missing |
waterBody has 666651 (34.6%) missing values | Missing |
islandGroup has 1925623 (> 99.9%) missing values | Missing |
island has 1925415 (99.9%) missing values | Missing |
countryCode has 110759 (5.7%) missing values | Missing |
stateProvince has 943673 (49.0%) missing values | Missing |
county has 1786420 (92.7%) missing values | Missing |
locality has 642386 (33.3%) missing values | Missing |
verbatimElevation has 1925931 (> 99.9%) missing values | Missing |
verbatimDepth has 1900149 (98.6%) missing values | Missing |
decimalLatitude has 927346 (48.1%) missing values | Missing |
decimalLongitude has 927346 (48.1%) missing values | Missing |
verbatimCoordinateSystem has 1246885 (64.7%) missing values | Missing |
verbatimSRS has 1926391 (> 99.9%) missing values | Missing |
footprintSRS has 1926391 (> 99.9%) missing values | Missing |
footprintSpatialFit has 1926391 (> 99.9%) missing values | Missing |
georeferencedBy has 1926391 (> 99.9%) missing values | Missing |
georeferencedDate has 1926391 (> 99.9%) missing values | Missing |
georeferenceProtocol has 1265790 (65.7%) missing values | Missing |
georeferenceSources has 1926390 (> 99.9%) missing values | Missing |
georeferenceRemarks has 1896105 (98.4%) missing values | Missing |
latestEonOrHighestEonothem has 1926392 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 1926392 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 1926391 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 1926390 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 1926390 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 1926392 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 1926388 (> 99.9%) missing values | Missing |
group has 1926391 (> 99.9%) missing values | Missing |
bed has 1926392 (> 99.9%) missing values | Missing |
identificationQualifier has 1908260 (99.1%) missing values | Missing |
typeStatus has 1841066 (95.6%) missing values | Missing |
identifiedBy has 1085208 (56.3%) missing values | Missing |
identifiedByID has 1926391 (> 99.9%) missing values | Missing |
dateIdentified has 1926391 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 1926390 (> 99.9%) missing values | Missing |
identificationRemarks has 1926392 (> 99.9%) missing values | Missing |
parentNameUsageID has 1926391 (> 99.9%) missing values | Missing |
namePublishedInID has 1926391 (> 99.9%) missing values | Missing |
acceptedNameUsage has 1926391 (> 99.9%) missing values | Missing |
parentNameUsage has 1926392 (> 99.9%) missing values | Missing |
namePublishedIn has 1926391 (> 99.9%) missing values | Missing |
class has 66157 (3.4%) missing values | Missing |
order has 329537 (17.1%) missing values | Missing |
family has 144488 (7.5%) missing values | Missing |
subtribe has 1926391 (> 99.9%) missing values | Missing |
genus has 358044 (18.6%) missing values | Missing |
genericName has 358043 (18.6%) missing values | Missing |
subgenus has 1926391 (> 99.9%) missing values | Missing |
infragenericEpithet has 1926391 (> 99.9%) missing values | Missing |
specificEpithet has 626798 (32.5%) missing values | Missing |
infraspecificEpithet has 1890289 (98.1%) missing values | Missing |
cultivarEpithet has 1926391 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 1926391 (> 99.9%) missing values | Missing |
vernacularName has 1926391 (> 99.9%) missing values | Missing |
nomenclaturalCode has 1926389 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 1926391 (> 99.9%) missing values | Missing |
taxonRemarks has 1926390 (> 99.9%) missing values | Missing |
elevation has 1919570 (99.6%) missing values | Missing |
elevationAccuracy has 1922885 (99.8%) missing values | Missing |
depth has 1143682 (59.4%) missing values | Missing |
depthAccuracy has 1205339 (62.6%) missing values | Missing |
distanceFromCentroidInMeters has 1917545 (99.5%) missing values | Missing |
mediaType has 1683241 (87.4%) missing values | Missing |
classKey has 66158 (3.4%) missing values | Missing |
orderKey has 329533 (17.1%) missing values | Missing |
familyKey has 144485 (7.5%) missing values | Missing |
genusKey has 358041 (18.6%) missing values | Missing |
subgenusKey has 1926388 (> 99.9%) missing values | Missing |
speciesKey has 626819 (32.5%) missing values | Missing |
species has 626822 (32.5%) missing values | Missing |
verbatimScientificName has 353775 (18.4%) missing values | Missing |
repatriated has 110144 (5.7%) missing values | Missing |
relativeOrganismQuantity has 1926392 (> 99.9%) missing values | Missing |
projectId has 1926390 (> 99.9%) missing values | Missing |
gbifRegion has 115678 (6.0%) missing values | Missing |
level0Gid has 1691070 (87.8%) missing values | Missing |
level0Name has 1691070 (87.8%) missing values | Missing |
level1Gid has 1694638 (88.0%) missing values | Missing |
level1Name has 1694634 (88.0%) missing values | Missing |
level2Gid has 1708984 (88.7%) missing values | Missing |
level2Name has 1709049 (88.7%) missing values | Missing |
level3Gid has 1886622 (97.9%) missing values | Missing |
level3Name has 1887342 (98.0%) missing values | Missing |
iucnRedListCategory has 469562 (24.4%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:50:57.098131 |
|---|---|
| Analysis finished | 2025-01-08 22:52:33.162435 |
| Duration | 1 minute and 36.06 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 1926393 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1926393 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321728981 |
|---|---|
| 2nd row | 1320179422 |
| 3rd row | 1320179575 |
| 4th row | 1321729723 |
| 5th row | 1320179846 |
| Value | Count | Frequency (%) |
| 1321728981 | 1 | < 0.1% |
| 2565454742 | 1 | < 0.1% |
| 1320179846 | 1 | < 0.1% |
| 1321730497 | 1 | < 0.1% |
| 1320180949 | 1 | < 0.1% |
| 1320181165 | 1 | < 0.1% |
| 1456364805 | 1 | < 0.1% |
| 1320182209 | 1 | < 0.1% |
| 1321732097 | 1 | < 0.1% |
| 2571470239 | 1 | < 0.1% |
| Other values (1926383) | 1926383 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3941642 | |
| 3 | 2930195 | |
| 2 | 2443917 | |
| 7 | 1519890 | 7.9% |
| 8 | 1483841 | 7.7% |
| 0 | 1476009 | 7.7% |
| 9 | 1469022 | 7.6% |
| 5 | 1371397 | 7.1% |
| 6 | 1317118 | 6.8% |
| 4 | 1310899 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19263930 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3941642 | |
| 3 | 2930195 | |
| 2 | 2443917 | |
| 7 | 1519890 | 7.9% |
| 8 | 1483841 | 7.7% |
| 0 | 1476009 | 7.7% |
| 9 | 1469022 | 7.6% |
| 5 | 1371397 | 7.1% |
| 6 | 1317118 | 6.8% |
| 4 | 1310899 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19263930 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3941642 | |
| 3 | 2930195 | |
| 2 | 2443917 | |
| 7 | 1519890 | 7.9% |
| 8 | 1483841 | 7.7% |
| 0 | 1476009 | 7.7% |
| 9 | 1469022 | 7.6% |
| 5 | 1371397 | 7.1% |
| 6 | 1317118 | 6.8% |
| 4 | 1310899 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19263930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3941642 | |
| 3 | 2930195 | |
| 2 | 2443917 | |
| 7 | 1519890 | 7.9% |
| 8 | 1483841 | 7.7% |
| 0 | 1476009 | 7.7% |
| 9 | 1469022 | 7.6% |
| 5 | 1371397 | 7.1% |
| 6 | 1317118 | 6.8% |
| 4 | 1310899 | 6.8% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 3852786 | |
| 0 | 3852786 | |
| _ | 3852786 | |
| 1 | 1926393 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5779179 | |
| Uppercase Letter | 3852786 | |
| Connector Punctuation | 3852786 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3852786 | |
| 1 | 1926393 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3852786 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3852786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9631965 | |
| Latin | 3852786 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3852786 | |
| _ | 3852786 | |
| 1 | 1926393 |
Latin
| Value | Count | Frequency (%) |
| C | 3852786 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13484751 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 3852786 | |
| 0 | 3852786 | |
| _ | 3852786 | |
| 1 | 1926393 |
modified
Text
| Distinct | 113487 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 62369 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | 2021-10-06T15:29:00Z |
|---|---|
| 2nd row | 2024-09-25T16:08:00Z |
| 3rd row | 2020-01-06T17:42:00Z |
| 4th row | 2018-09-17T12:46:00Z |
| 5th row | 2024-09-25T15:32:00Z |
| Value | Count | Frequency (%) |
| 2024-09-25t13:44:00z | 9049 | 0.5% |
| 2024-09-25t13:46:00z | 8728 | 0.5% |
| 2024-09-25t17:07:00z | 8646 | 0.4% |
| 2024-09-25t17:10:00z | 8633 | 0.4% |
| 2024-09-25t17:05:00z | 8623 | 0.4% |
| 2024-09-25t13:45:00z | 8553 | 0.4% |
| 2024-09-25t17:11:00z | 8500 | 0.4% |
| 2024-09-25t17:08:00z | 8494 | 0.4% |
| 2024-09-25t15:27:00z | 8472 | 0.4% |
| 2024-09-25t17:15:00z | 8471 | 0.4% |
| Other values (113477) | 1840224 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8971409 | |
| 2 | 4988502 | |
| 1 | 4688771 | |
| - | 3852786 | |
| : | 3852786 | |
| T | 1926393 | 5.0% |
| Z | 1926393 | 5.0% |
| 4 | 1757743 | 4.6% |
| 5 | 1702088 | 4.4% |
| 9 | 1536985 | 4.0% |
| Other values (4) | 3324004 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26969502 | |
| Dash Punctuation | 3852786 | 10.0% |
| Other Punctuation | 3852786 | 10.0% |
| Uppercase Letter | 3852786 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8971409 | |
| 2 | 4988502 | |
| 1 | 4688771 | |
| 4 | 1757743 | 6.5% |
| 5 | 1702088 | 6.3% |
| 9 | 1536985 | 5.7% |
| 3 | 1149855 | 4.3% |
| 7 | 807767 | 3.0% |
| 6 | 701085 | 2.6% |
| 8 | 665297 | 2.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1926393 | |
| Z | 1926393 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3852786 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 34675074 | |
| Latin | 3852786 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8971409 | |
| 2 | 4988502 | |
| 1 | 4688771 | |
| - | 3852786 | |
| : | 3852786 | |
| 4 | 1757743 | 5.1% |
| 5 | 1702088 | 4.9% |
| 9 | 1536985 | 4.4% |
| 3 | 1149855 | 3.3% |
| 7 | 807767 | 2.3% |
| Other values (2) | 1366382 | 3.9% |
Latin
| Value | Count | Frequency (%) |
| T | 1926393 | |
| Z | 1926393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38527860 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8971409 | |
| 2 | 4988502 | |
| 1 | 4688771 | |
| - | 3852786 | |
| : | 3852786 | |
| T | 1926393 | 5.0% |
| Z | 1926393 | 5.0% |
| 4 | 1757743 | 4.6% |
| 5 | 1702088 | 4.4% |
| 9 | 1536985 | 4.0% |
| Other values (4) | 3324004 | 8.6% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 1926393 | |
| museum | 1926393 | |
| of | 1926393 | |
| natural | 1926393 | |
| history | 1926393 | |
| smithsonian | 1926393 | |
| institution | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 13484751 | |
| i | 11558358 | |
| 11558358 | ||
| a | 9631965 | 8.5% |
| o | 9631965 | 8.5% |
| n | 9631965 | 8.5% |
| s | 7705572 | 6.8% |
| u | 7705572 | 6.8% |
| r | 3852786 | 3.4% |
| m | 3852786 | 3.4% |
| Other values (11) | 25043109 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88614078 | |
| Space Separator | 11558358 | 10.2% |
| Uppercase Letter | 11558358 | 10.2% |
| Other Punctuation | 1926393 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 13484751 | |
| i | 11558358 | |
| a | 9631965 | |
| o | 9631965 | |
| n | 9631965 | |
| s | 7705572 | |
| u | 7705572 | |
| r | 3852786 | 4.3% |
| m | 3852786 | 4.3% |
| l | 3852786 | 4.3% |
| Other values (4) | 7705572 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3852786 | |
| M | 1926393 | |
| H | 1926393 | |
| S | 1926393 | |
| I | 1926393 |
Space Separator
| Value | Count | Frequency (%) |
| 11558358 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1926393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100172436 | |
| Common | 13484751 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 13484751 | |
| i | 11558358 | |
| a | 9631965 | |
| o | 9631965 | |
| n | 9631965 | |
| s | 7705572 | 7.7% |
| u | 7705572 | 7.7% |
| r | 3852786 | 3.8% |
| m | 3852786 | 3.8% |
| N | 3852786 | 3.8% |
| Other values (9) | 19263930 |
Common
| Value | Count | Frequency (%) |
| 11558358 | ||
| , | 1926393 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113657187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 13484751 | |
| i | 11558358 | |
| 11558358 | ||
| a | 9631965 | 8.5% |
| o | 9631965 | 8.5% |
| n | 9631965 | 8.5% |
| s | 7705572 | 6.8% |
| u | 7705572 | 6.8% |
| r | 3852786 | 3.4% |
| m | 3852786 | 3.4% |
| Other values (11) | 25043109 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7705572 | |
| : | 7705572 | |
| l | 5779179 | 10.3% |
| i | 3852786 | 6.9% |
| r | 3852786 | 6.9% |
| c | 3852786 | 6.9% |
| g | 1926393 | 3.4% |
| 7 | 1926393 | 3.4% |
| 8 | 1926393 | 3.4% |
| 4 | 1926393 | 3.4% |
| Other values (8) | 15411144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36601467 | |
| Other Punctuation | 9631965 | 17.2% |
| Decimal Number | 9631965 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7705572 | |
| l | 5779179 | |
| i | 3852786 | |
| r | 3852786 | |
| c | 3852786 | |
| g | 1926393 | 5.3% |
| u | 1926393 | 5.3% |
| b | 1926393 | 5.3% |
| d | 1926393 | 5.3% |
| s | 1926393 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1926393 | |
| 8 | 1926393 | |
| 4 | 1926393 | |
| 3 | 1926393 | |
| 1 | 1926393 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 7705572 | |
| . | 1926393 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36601467 | |
| Common | 19263930 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 7705572 | |
| l | 5779179 | |
| i | 3852786 | |
| r | 3852786 | |
| c | 3852786 | |
| g | 1926393 | 5.3% |
| u | 1926393 | 5.3% |
| b | 1926393 | 5.3% |
| d | 1926393 | 5.3% |
| s | 1926393 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 7705572 | |
| 7 | 1926393 | 10.0% |
| 8 | 1926393 | 10.0% |
| 4 | 1926393 | 10.0% |
| 3 | 1926393 | 10.0% |
| . | 1926393 | 10.0% |
| 1 | 1926393 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55865397 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 7705572 | |
| : | 7705572 | |
| l | 5779179 | 10.3% |
| i | 3852786 | 6.9% |
| r | 3852786 | 6.9% |
| c | 3852786 | 6.9% |
| g | 1926393 | 3.4% |
| 7 | 1926393 | 3.4% |
| 8 | 1926393 | 3.4% |
| 4 | 1926393 | 3.4% |
| Other values (8) | 15411144 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
|---|---|
| 2nd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 3rd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| Value | Count | Frequency (%) |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 9631965 | |
| 1 | 7705572 | 8.9% |
| - | 7705572 | 8.9% |
| u | 5779179 | 6.7% |
| 8 | 5779179 | 6.7% |
| 2 | 5779179 | 6.7% |
| 4 | 5779179 | 6.7% |
| c | 5779179 | 6.7% |
| f | 5779179 | 6.7% |
| 9 | 3852786 | 4.4% |
| Other values (9) | 23116716 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40454253 | |
| Decimal Number | 34675074 | |
| Dash Punctuation | 7705572 | 8.9% |
| Other Punctuation | 3852786 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 9631965 | |
| u | 5779179 | |
| c | 5779179 | |
| f | 5779179 | |
| b | 3852786 | 9.5% |
| r | 1926393 | 4.8% |
| i | 1926393 | 4.8% |
| a | 1926393 | 4.8% |
| n | 1926393 | 4.8% |
| e | 1926393 | 4.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7705572 | |
| 8 | 5779179 | |
| 2 | 5779179 | |
| 4 | 5779179 | |
| 9 | 3852786 | |
| 7 | 3852786 | |
| 6 | 1926393 | 5.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7705572 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46233432 | |
| Latin | 40454253 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 9631965 | |
| u | 5779179 | |
| c | 5779179 | |
| f | 5779179 | |
| b | 3852786 | 9.5% |
| r | 1926393 | 4.8% |
| i | 1926393 | 4.8% |
| a | 1926393 | 4.8% |
| n | 1926393 | 4.8% |
| e | 1926393 | 4.8% |
Common
| Value | Count | Frequency (%) |
| 1 | 7705572 | |
| - | 7705572 | |
| 8 | 5779179 | |
| 2 | 5779179 | |
| 4 | 5779179 | |
| 9 | 3852786 | |
| : | 3852786 | |
| 7 | 3852786 | |
| 6 | 1926393 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86687685 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 9631965 | |
| 1 | 7705572 | 8.9% |
| - | 7705572 | 8.9% |
| u | 5779179 | 6.7% |
| 8 | 5779179 | 6.7% |
| 2 | 5779179 | 6.7% |
| 4 | 5779179 | 6.7% |
| c | 5779179 | 6.7% |
| f | 5779179 | 6.7% |
| 9 | 3852786 | 4.4% |
| Other values (9) | 23116716 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 1926393 | |
| S | 1926393 | |
| N | 1926393 | |
| M | 1926393 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7705572 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1926393 | |
| S | 1926393 | |
| N | 1926393 | |
| M | 1926393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7705572 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 1926393 | |
| S | 1926393 | |
| N | 1926393 | |
| M | 1926393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7705572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 1926393 | |
| S | 1926393 | |
| N | 1926393 | |
| M | 1926393 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IZ |
|---|---|
| 2nd row | IZ |
| 3rd row | IZ |
| 4th row | IZ |
| 5th row | IZ |
| Value | Count | Frequency (%) |
| iz | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1926393 | |
| Z | 1926393 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3852786 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1926393 | |
| Z | 1926393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3852786 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1926393 | |
| Z | 1926393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3852786 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1926393 | |
| Z | 1926393 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 1926393 | |
| extant | 1926393 | |
| biology | 1926393 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 3852786 | 10.5% |
| 3852786 | 10.5% | |
| t | 3852786 | 10.5% |
| o | 3852786 | 10.5% |
| M | 1926393 | 5.3% |
| H | 1926393 | 5.3% |
| E | 1926393 | 5.3% |
| x | 1926393 | 5.3% |
| a | 1926393 | 5.3% |
| n | 1926393 | 5.3% |
| Other values (5) | 9631965 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21190323 | |
| Uppercase Letter | 11558358 | |
| Space Separator | 3852786 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3852786 | |
| o | 3852786 | |
| x | 1926393 | |
| a | 1926393 | |
| n | 1926393 | |
| i | 1926393 | |
| l | 1926393 | |
| g | 1926393 | |
| y | 1926393 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3852786 | |
| M | 1926393 | |
| H | 1926393 | |
| E | 1926393 | |
| B | 1926393 |
Space Separator
| Value | Count | Frequency (%) |
| 3852786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32748681 | |
| Common | 3852786 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 3852786 | |
| t | 3852786 | |
| o | 3852786 | |
| M | 1926393 | 5.9% |
| H | 1926393 | 5.9% |
| E | 1926393 | 5.9% |
| x | 1926393 | 5.9% |
| a | 1926393 | 5.9% |
| n | 1926393 | 5.9% |
| B | 1926393 | 5.9% |
| Other values (4) | 7705572 |
Common
| Value | Count | Frequency (%) |
| 3852786 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36601467 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 3852786 | 10.5% |
| 3852786 | 10.5% | |
| t | 3852786 | 10.5% |
| o | 3852786 | 10.5% |
| M | 1926393 | 5.3% |
| H | 1926393 | 5.3% |
| E | 1926393 | 5.3% |
| x | 1926393 | 5.3% |
| a | 1926393 | 5.3% |
| n | 1926393 | 5.3% |
| Other values (5) | 9631965 |
basisOfRecord
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.00144052 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 1922256 | |
| machine_observation | 3456 | 0.2% |
| human_observation | 681 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 9618873 | |
| R | 3848649 | |
| S | 3848649 | |
| P | 3844512 | 11.1% |
| N | 1930530 | 5.6% |
| I | 1929849 | 5.6% |
| _ | 1926393 | 5.6% |
| M | 1926393 | 5.6% |
| V | 1926393 | 5.6% |
| C | 1925712 | 5.6% |
| Other values (7) | 1951896 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 32751456 | |
| Connector Punctuation | 1926393 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 9618873 | |
| R | 3848649 | |
| S | 3848649 | |
| P | 3844512 | 11.7% |
| N | 1930530 | 5.9% |
| I | 1929849 | 5.9% |
| M | 1926393 | 5.9% |
| V | 1926393 | 5.9% |
| C | 1925712 | 5.9% |
| D | 1922256 | 5.9% |
| Other values (6) | 29640 | 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1926393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32751456 | |
| Common | 1926393 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 9618873 | |
| R | 3848649 | |
| S | 3848649 | |
| P | 3844512 | 11.7% |
| N | 1930530 | 5.9% |
| I | 1929849 | 5.9% |
| M | 1926393 | 5.9% |
| V | 1926393 | 5.9% |
| C | 1925712 | 5.9% |
| D | 1922256 | 5.9% |
| Other values (6) | 29640 | 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 1926393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34677849 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 9618873 | |
| R | 3848649 | |
| S | 3848649 | |
| P | 3844512 | 11.1% |
| N | 1930530 | 5.6% |
| I | 1929849 | 5.6% |
| _ | 1926393 | 5.6% |
| M | 1926393 | 5.6% |
| V | 1926393 | 5.6% |
| C | 1925712 | 5.6% |
| Other values (7) | 1951896 | 5.6% |
occurrenceID
Text
Unique 
| Distinct | 1926393 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 1926393 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/383ab647e-23a7-4086-b71e-36212ccc0eb2 |
| 3rd row | http://n2t.net/ark:/65665/383adbf6e-f769-4dc3-8bef-550530af49ee |
| 4th row | http://n2t.net/ark:/65665/3c83aad38-c935-46fa-96c3-e450ebb169cf |
| 5th row | http://n2t.net/ark:/65665/383b126a6-bf3a-4908-bc33-e4435555fcc5 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8609028-15fe-4982-820a-6e4cef3b3db1 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383b126a6-bf3a-4908-bc33-e4435555fcc5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c843fd56-7874-4858-b938-14fdfcb5544c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383bcb698-5477-4feb-9966-d9adae345f09 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383bfd766-40bc-4ede-82ca-0df3775130f3 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c84cf22c-2b9b-49fb-91ed-f85efd9e9fa7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383cb8e2a-4f46-4138-82be-3d7989851c9e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c856104b-9825-44b9-8b57-e69b58510bf8 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c856ef4e-b135-45c8-8511-c533777f0d7a | 1 | < 0.1% |
| Other values (1926383) | 1926383 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 9631965 | 7.9% |
| 6 | 9394397 | 7.7% |
| - | 7705572 | 6.3% |
| t | 7705572 | 6.3% |
| 5 | 7461179 | 6.1% |
| a | 6018602 | 5.0% |
| 3 | 5539470 | 4.6% |
| e | 5537694 | 4.6% |
| 2 | 5537394 | 4.6% |
| 4 | 5534549 | 4.6% |
| Other values (16) | 51296365 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52493612 | |
| Lowercase Letter | 45752431 | |
| Other Punctuation | 15411144 | 12.7% |
| Dash Punctuation | 7705572 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 7705572 | |
| a | 6018602 | |
| e | 5537694 | |
| b | 4095432 | |
| n | 3852786 | |
| d | 3615504 | |
| c | 3611680 | |
| f | 3609589 | |
| k | 1926393 | 4.2% |
| r | 1926393 | 4.2% |
| Other values (2) | 3852786 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9394397 | |
| 5 | 7461179 | |
| 3 | 5539470 | |
| 2 | 5537394 | |
| 4 | 5534549 | |
| 8 | 4095501 | |
| 9 | 4095255 | |
| 1 | 3613792 | 6.9% |
| 7 | 3611338 | 6.9% |
| 0 | 3610737 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 9631965 | |
| : | 3852786 | 25.0% |
| . | 1926393 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7705572 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75610328 | |
| Latin | 45752431 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 9631965 | |
| 6 | 9394397 | |
| - | 7705572 | |
| 5 | 7461179 | |
| 3 | 5539470 | |
| 2 | 5537394 | |
| 4 | 5534549 | |
| 8 | 4095501 | 5.4% |
| 9 | 4095255 | 5.4% |
| : | 3852786 | 5.1% |
| Other values (4) | 12762260 |
Latin
| Value | Count | Frequency (%) |
| t | 7705572 | |
| a | 6018602 | |
| e | 5537694 | |
| b | 4095432 | |
| n | 3852786 | |
| d | 3615504 | |
| c | 3611680 | |
| f | 3609589 | |
| k | 1926393 | 4.2% |
| r | 1926393 | 4.2% |
| Other values (2) | 3852786 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 121362759 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 9631965 | 7.9% |
| 6 | 9394397 | 7.7% |
| - | 7705572 | 6.3% |
| t | 7705572 | 6.3% |
| 5 | 7461179 | 6.1% |
| a | 6018602 | 5.0% |
| 3 | 5539470 | 4.6% |
| e | 5537694 | 4.6% |
| 2 | 5537394 | 4.6% |
| 4 | 5534549 | 4.6% |
| Other values (16) | 51296365 |
catalogNumber
Text
| Distinct | 1355393 |
|---|---|
| Distinct (%) | 70.4% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.0374042 |
| Min length | 6 |
Unique
| Unique | 1024476 ? |
|---|---|
| Unique (%) | 53.2% |
Sample
| 1st row | USNM 1119015 |
|---|---|
| 2nd row | USNM 55168 |
| 3rd row | USNM 52536 |
| 4th row | USNM E40844 |
| 5th row | USNM 241160 |
| Value | Count | Frequency (%) |
| usnm | 1926388 | |
| 31 | < 0.1% | |
| 284908 | 16 | < 0.1% |
| 653324 | 13 | < 0.1% |
| 5357 | 11 | < 0.1% |
| 15490 | 10 | < 0.1% |
| 22869 | 10 | < 0.1% |
| 859036 | 10 | < 0.1% |
| 224878 | 10 | < 0.1% |
| 40969 | 9 | < 0.1% |
| Other values (1352149) | 1926301 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1928507 | 9.1% |
| U | 1926495 | 9.1% |
| 1926421 | 9.1% | |
| S | 1926388 | 9.1% |
| N | 1926388 | 9.1% |
| 1 | 1809864 | 8.5% |
| 2 | 1247566 | 5.9% |
| 3 | 1147864 | 5.4% |
| 4 | 1110834 | 5.2% |
| 5 | 1088355 | 5.1% |
| Other values (53) | 5223641 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11557547 | |
| Uppercase Letter | 7763402 | |
| Space Separator | 1926421 | 9.1% |
| Lowercase Letter | 11690 | 0.1% |
| Other Punctuation | 3259 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8276 | |
| b | 1739 | 14.9% |
| c | 637 | 5.4% |
| d | 326 | 2.8% |
| e | 206 | 1.8% |
| f | 143 | 1.2% |
| g | 87 | 0.7% |
| h | 61 | 0.5% |
| i | 40 | 0.3% |
| j | 35 | 0.3% |
| Other values (16) | 140 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1928507 | |
| U | 1926495 | |
| S | 1926388 | |
| N | 1926388 | |
| E | 53455 | 0.7% |
| I | 778 | < 0.1% |
| A | 697 | < 0.1% |
| X | 326 | < 0.1% |
| B | 177 | < 0.1% |
| D | 128 | < 0.1% |
| Other values (10) | 63 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1809864 | |
| 2 | 1247566 | |
| 3 | 1147864 | |
| 4 | 1110834 | |
| 5 | 1088355 | |
| 8 | 1073474 | |
| 6 | 1062349 | |
| 7 | 1058933 | |
| 0 | 1002104 | |
| 9 | 956204 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 3252 | |
| . | 6 | 0.2% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1926421 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13487231 | |
| Latin | 7775092 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1928507 | |
| U | 1926495 | |
| S | 1926388 | |
| N | 1926388 | |
| E | 53455 | 0.7% |
| a | 8276 | 0.1% |
| b | 1739 | < 0.1% |
| I | 778 | < 0.1% |
| A | 697 | < 0.1% |
| c | 637 | < 0.1% |
| Other values (36) | 1732 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1926421 | ||
| 1 | 1809864 | |
| 2 | 1247566 | |
| 3 | 1147864 | |
| 4 | 1110834 | |
| 5 | 1088355 | |
| 8 | 1073474 | |
| 6 | 1062349 | |
| 7 | 1058933 | |
| 0 | 1002104 | |
| Other values (7) | 959467 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21262323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1928507 | 9.1% |
| U | 1926495 | 9.1% |
| 1926421 | 9.1% | |
| S | 1926388 | 9.1% |
| N | 1926388 | 9.1% |
| 1 | 1809864 | 8.5% |
| 2 | 1247566 | 5.9% |
| 3 | 1147864 | 5.4% |
| 4 | 1110834 | 5.2% |
| 5 | 1088355 | 5.1% |
| Other values (53) | 5223641 |
recordNumber
Text
Missing 
| Distinct | 119495 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 1804640 |
| Missing (%) | 93.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 14 |
| Mean length | 13.17353166 |
| Min length | 1 |
Unique
| Unique | 118866 ? |
|---|---|
| Unique (%) | 97.6% |
Sample
| 1st row | USNPC # 001298 |
|---|---|
| 2nd row | FPlrv_430 |
| 3rd row | H-2284 |
| 4th row | USNPC # 066527 |
| 5th row | USNPC # 009815 |
| Value | Count | Frequency (%) |
| 88145 | ||
| usnpc | 88064 | |
| ullz | 5209 | 1.7% |
| rh | 1566 | 0.5% |
| k-rh | 1555 | 0.5% |
| ce16007-event | 223 | 0.1% |
| 2208 | 102 | < 0.1% |
| 1430 | 92 | < 0.1% |
| 1513 | 80 | < 0.1% |
| beauty | 75 | < 0.1% |
| Other values (119414) | 122317 |
Most occurring characters
| Value | Count | Frequency (%) |
| 185675 | 11.6% | |
| 0 | 161175 | 10.0% |
| C | 97557 | 6.1% |
| S | 95231 | 5.9% |
| U | 94869 | 5.9% |
| P | 94146 | 5.9% |
| N | 93453 | 5.8% |
| # | 88221 | 5.5% |
| 1 | 83004 | 5.2% |
| 2 | 65151 | 4.1% |
| Other values (71) | 545435 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 709687 | |
| Uppercase Letter | 576534 | |
| Space Separator | 185675 | 11.6% |
| Other Punctuation | 91637 | 5.7% |
| Dash Punctuation | 15241 | 1.0% |
| Connector Punctuation | 14091 | 0.9% |
| Lowercase Letter | 10490 | 0.7% |
| Close Punctuation | 281 | < 0.1% |
| Open Punctuation | 271 | < 0.1% |
| Math Symbol | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 97557 | |
| S | 95231 | |
| U | 94869 | |
| P | 94146 | |
| N | 93453 | |
| L | 12317 | 2.1% |
| E | 11806 | 2.0% |
| R | 10316 | 1.8% |
| I | 7528 | 1.3% |
| B | 7241 | 1.3% |
| Other values (16) | 52070 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1416 | |
| v | 1363 | |
| a | 1349 | |
| r | 1268 | |
| t | 873 | |
| e | 713 | |
| s | 657 | 6.3% |
| n | 489 | 4.7% |
| c | 300 | 2.9% |
| i | 287 | 2.7% |
| Other values (16) | 1775 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 161175 | |
| 1 | 83004 | |
| 2 | 65151 | |
| 6 | 58928 | 8.3% |
| 3 | 58890 | 8.3% |
| 7 | 58489 | 8.2% |
| 4 | 56685 | 8.0% |
| 8 | 56225 | 7.9% |
| 9 | 55924 | 7.9% |
| 5 | 55216 | 7.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 88221 | |
| . | 2352 | 2.6% |
| : | 559 | 0.6% |
| , | 400 | 0.4% |
| ; | 65 | 0.1% |
| / | 20 | < 0.1% |
| & | 10 | < 0.1% |
| ? | 7 | < 0.1% |
| * | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15240 | |
| – | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 273 | |
| ] | 8 | 2.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 263 | |
| [ | 8 | 3.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| = | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 185675 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 14091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1016893 | |
| Latin | 587024 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 97557 | |
| S | 95231 | |
| U | 94869 | |
| P | 94146 | |
| N | 93453 | |
| L | 12317 | 2.1% |
| E | 11806 | 2.0% |
| R | 10316 | 1.8% |
| I | 7528 | 1.3% |
| B | 7241 | 1.2% |
| Other values (42) | 62560 |
Common
| Value | Count | Frequency (%) |
| 185675 | ||
| 0 | 161175 | |
| # | 88221 | |
| 1 | 83004 | |
| 2 | 65151 | 6.4% |
| 6 | 58928 | 5.8% |
| 3 | 58890 | 5.8% |
| 7 | 58489 | 5.8% |
| 4 | 56685 | 5.6% |
| 8 | 56225 | 5.5% |
| Other values (19) | 144450 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1603916 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 185675 | 11.6% | |
| 0 | 161175 | 10.0% |
| C | 97557 | 6.1% |
| S | 95231 | 5.9% |
| U | 94869 | 5.9% |
| P | 94146 | 5.9% |
| N | 93453 | 5.8% |
| # | 88221 | 5.5% |
| 1 | 83004 | 5.2% |
| 2 | 65151 | 4.1% |
| Other values (70) | 545434 |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
recordedBy
Text
Missing 
| Distinct | 37540 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 764111 |
| Missing (%) | 39.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 24975 |
|---|---|
| Median length | 156 |
| Mean length | 23.05844881 |
| Min length | 1 |
Unique
| Unique | 16583 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | VIMS for BLM/ MMS |
|---|---|
| 2nd row | Lgl Ecological Research Associates/ Environmental Science And Engineering For BLM/ MMS |
| 3rd row | University of Southern California |
| 4th row | United States Fish Commission |
| 5th row | United States Fish Commission |
| Value | Count | Frequency (%) |
| mms | 181011 | 4.2% |
| blm | 181009 | 4.2% |
| for | 178053 | 4.2% |
| fish | 168374 | 3.9% |
| united | 164153 | 3.8% |
| states | 163489 | 3.8% |
| commission | 157086 | 3.7% |
| 149581 | 3.5% | |
| of | 101785 | 2.4% |
| j | 101464 | 2.4% |
| Other values (19944) | 2737862 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3119233 | 11.6% | |
| e | 2082533 | 7.8% |
| i | 1879315 | 7.0% |
| n | 1616256 | 6.0% |
| t | 1592703 | 5.9% |
| o | 1549732 | 5.8% |
| s | 1530048 | 5.7% |
| a | 1499473 | 5.6% |
| r | 1221276 | 4.6% |
| M | 808831 | 3.0% |
| Other values (89) | 9901020 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17592737 | |
| Uppercase Letter | 4868630 | 18.2% |
| Space Separator | 3119233 | 11.6% |
| Other Punctuation | 1145135 | 4.3% |
| Dash Punctuation | 53764 | 0.2% |
| Decimal Number | 11194 | < 0.1% |
| Control | 7851 | < 0.1% |
| Open Punctuation | 688 | < 0.1% |
| Close Punctuation | 688 | < 0.1% |
| Connector Punctuation | 479 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2082533 | |
| i | 1879315 | |
| n | 1616256 | |
| t | 1592703 | |
| o | 1549732 | |
| s | 1530048 | |
| a | 1499473 | |
| r | 1221276 | 6.9% |
| l | 768101 | 4.4% |
| h | 563773 | 3.2% |
| Other values (31) | 3289527 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 808831 | |
| S | 654014 | |
| B | 397882 | 8.2% |
| C | 365204 | 7.5% |
| F | 349332 | 7.2% |
| L | 335906 | 6.9% |
| U | 267323 | 5.5% |
| H | 212626 | 4.4% |
| R | 189118 | 3.9% |
| W | 154368 | 3.2% |
| Other values (17) | 1134026 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 741943 | |
| / | 238289 | 20.8% |
| & | 118079 | 10.3% |
| , | 45735 | 4.0% |
| : | 572 | < 0.1% |
| ' | 383 | < 0.1% |
| ; | 79 | < 0.1% |
| " | 40 | < 0.1% |
| ? | 11 | < 0.1% |
| # | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2213 | |
| 1 | 1914 | |
| 0 | 1548 | |
| 9 | 1226 | |
| 4 | 936 | |
| 8 | 714 | 6.4% |
| 5 | 696 | 6.2% |
| 6 | 693 | 6.2% |
| 3 | 681 | 6.1% |
| 7 | 573 | 5.1% |
Control
| Value | Count | Frequency (%) |
| 7816 | ||
| 35 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 686 | |
| { | 2 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 686 | |
| } | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3119233 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 53764 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 479 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22461367 | |
| Common | 4339053 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2082533 | 9.3% |
| i | 1879315 | 8.4% |
| n | 1616256 | 7.2% |
| t | 1592703 | 7.1% |
| o | 1549732 | 6.9% |
| s | 1530048 | 6.8% |
| a | 1499473 | 6.7% |
| r | 1221276 | 5.4% |
| M | 808831 | 3.6% |
| l | 768101 | 3.4% |
| Other values (58) | 7913099 |
Common
| Value | Count | Frequency (%) |
| 3119233 | ||
| . | 741943 | 17.1% |
| / | 238289 | 5.5% |
| & | 118079 | 2.7% |
| - | 53764 | 1.2% |
| , | 45735 | 1.1% |
| 7816 | 0.2% | |
| 2 | 2213 | 0.1% |
| 1 | 1914 | < 0.1% |
| 0 | 1548 | < 0.1% |
| Other values (21) | 8519 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26799498 | |
| None | 922 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3119233 | 11.6% | |
| e | 2082533 | 7.8% |
| i | 1879315 | 7.0% |
| n | 1616256 | 6.0% |
| t | 1592703 | 5.9% |
| o | 1549732 | 5.8% |
| s | 1530048 | 5.7% |
| a | 1499473 | 5.6% |
| r | 1221276 | 4.6% |
| M | 808831 | 3.0% |
| Other values (73) | 9900098 |
None
| Value | Count | Frequency (%) |
| é | 455 | |
| ü | 102 | 11.1% |
| á | 93 | 10.1% |
| ö | 65 | 7.0% |
| ä | 57 | 6.2% |
| ó | 53 | 5.7% |
| í | 49 | 5.3% |
| è | 15 | 1.6% |
| ñ | 12 | 1.3% |
| ç | 9 | 1.0% |
| Other values (6) | 12 | 1.3% |
individualCount
Text
| Distinct | 1067 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 156 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.108392166 |
| Min length | 1 |
Unique
| Unique | 413 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 11 |
| 3rd row | 1 |
| 4th row | 26 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 995782 | |
| 2 | 289569 | 15.0% |
| 3 | 135771 | 7.0% |
| 4 | 99105 | 5.1% |
| 5 | 73928 | 3.8% |
| 6 | 51745 | 2.7% |
| 10 | 38953 | 2.0% |
| 7 | 31375 | 1.6% |
| 8 | 30170 | 1.6% |
| 9 | 18501 | 1.0% |
| Other values (1057) | 161338 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1131608 | |
| 2 | 345490 | 16.2% |
| 3 | 162143 | 7.6% |
| 4 | 118965 | 5.6% |
| 5 | 110284 | 5.2% |
| 0 | 93507 | 4.4% |
| 6 | 64569 | 3.0% |
| 7 | 42177 | 2.0% |
| 8 | 40055 | 1.9% |
| 9 | 26228 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2135026 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1131608 | |
| 2 | 345490 | 16.2% |
| 3 | 162143 | 7.6% |
| 4 | 118965 | 5.6% |
| 5 | 110284 | 5.2% |
| 0 | 93507 | 4.4% |
| 6 | 64569 | 3.0% |
| 7 | 42177 | 2.0% |
| 8 | 40055 | 1.9% |
| 9 | 26228 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135026 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1131608 | |
| 2 | 345490 | 16.2% |
| 3 | 162143 | 7.6% |
| 4 | 118965 | 5.6% |
| 5 | 110284 | 5.2% |
| 0 | 93507 | 4.4% |
| 6 | 64569 | 3.0% |
| 7 | 42177 | 2.0% |
| 8 | 40055 | 1.9% |
| 9 | 26228 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135026 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1131608 | |
| 2 | 345490 | 16.2% |
| 3 | 162143 | 7.6% |
| 4 | 118965 | 5.6% |
| 5 | 110284 | 5.2% |
| 0 | 93507 | 4.4% |
| 6 | 64569 | 3.0% |
| 7 | 42177 | 2.0% |
| 8 | 40055 | 1.9% |
| 9 | 26228 | 1.2% |
sex
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1802980 |
| Missing (%) | 93.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 5.129864763 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FEMALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| female | 68541 | |
| male | 54610 | |
| hermaphrodite | 262 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 192216 | |
| M | 123413 | |
| A | 123413 | |
| L | 123151 | |
| F | 68541 | 10.8% |
| H | 524 | 0.1% |
| R | 524 | 0.1% |
| P | 262 | < 0.1% |
| O | 262 | < 0.1% |
| D | 262 | < 0.1% |
| Other values (2) | 524 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 633092 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 192216 | |
| M | 123413 | |
| A | 123413 | |
| L | 123151 | |
| F | 68541 | 10.8% |
| H | 524 | 0.1% |
| R | 524 | 0.1% |
| P | 262 | < 0.1% |
| O | 262 | < 0.1% |
| D | 262 | < 0.1% |
| Other values (2) | 524 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 633092 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 192216 | |
| M | 123413 | |
| A | 123413 | |
| L | 123151 | |
| F | 68541 | 10.8% |
| H | 524 | 0.1% |
| R | 524 | 0.1% |
| P | 262 | < 0.1% |
| O | 262 | < 0.1% |
| D | 262 | < 0.1% |
| Other values (2) | 524 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 633092 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 192216 | |
| M | 123413 | |
| A | 123413 | |
| L | 123151 | |
| F | 68541 | 10.8% |
| H | 524 | 0.1% |
| R | 524 | 0.1% |
| P | 262 | < 0.1% |
| O | 262 | < 0.1% |
| D | 262 | < 0.1% |
| Other values (2) | 524 | 0.1% |
lifeStage
Text
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1888856 |
| Missing (%) | 98.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.544262994 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Larva |
|---|---|
| 2nd row | Juvenile |
| 3rd row | Larva |
| 4th row | Juvenile |
| 5th row | Larva |
| Value | Count | Frequency (%) |
| juvenile | 18119 | |
| adult | 9874 | |
| larva | 7695 | |
| immature | 711 | 1.9% |
| mature | 247 | 0.7% |
| subadult | 244 | 0.7% |
| egg | 142 | 0.4% |
| megalopa | 131 | 0.3% |
| veliger | 126 | 0.3% |
| zoea | 95 | 0.3% |
| Other values (9) | 153 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 37685 | |
| u | 29584 | |
| l | 28565 | |
| v | 25814 | |
| i | 18319 | |
| n | 18135 | |
| J | 18119 | |
| a | 17028 | |
| t | 11097 | 4.5% |
| d | 10135 | 4.1% |
| Other values (25) | 31171 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 208115 | |
| Uppercase Letter | 37537 | 15.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 37685 | |
| u | 29584 | |
| l | 28565 | |
| v | 25814 | |
| i | 18319 | |
| n | 18135 | |
| a | 17028 | |
| t | 11097 | 5.3% |
| d | 10135 | 4.9% |
| r | 8805 | 4.2% |
| Other values (11) | 2948 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 18119 | |
| A | 9874 | |
| L | 7695 | |
| I | 711 | 1.9% |
| M | 389 | 1.0% |
| S | 244 | 0.7% |
| E | 162 | 0.4% |
| V | 126 | 0.3% |
| Z | 95 | 0.3% |
| N | 87 | 0.2% |
| Other values (4) | 35 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 245652 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 37685 | |
| u | 29584 | |
| l | 28565 | |
| v | 25814 | |
| i | 18319 | |
| n | 18135 | |
| J | 18119 | |
| a | 17028 | |
| t | 11097 | 4.5% |
| d | 10135 | 4.1% |
| Other values (25) | 31171 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 245652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 37685 | |
| u | 29584 | |
| l | 28565 | |
| v | 25814 | |
| i | 18319 | |
| n | 18135 | |
| J | 18119 | |
| a | 17028 | |
| t | 11097 | 4.5% |
| d | 10135 | 4.1% |
| Other values (25) | 31171 |
occurrenceStatus
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.997880495 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 1922302 | |
| absent | 4089 | 0.2% |
| 1993-09-09 | 1 | < 0.1% |
| 1938-09-22 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3848693 | |
| S | 1926391 | |
| N | 1926391 | |
| T | 1926391 | |
| P | 1922302 | |
| R | 1922302 | |
| A | 4089 | < 0.1% |
| B | 4089 | < 0.1% |
| 9 | 6 | < 0.1% |
| - | 4 | < 0.1% |
| Other values (5) | 10 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13480648 | |
| Decimal Number | 16 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3848693 | |
| S | 1926391 | |
| N | 1926391 | |
| T | 1926391 | |
| P | 1922302 | |
| R | 1922302 | |
| A | 4089 | < 0.1% |
| B | 4089 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6 | |
| 0 | 3 | |
| 1 | 2 | 12.5% |
| 3 | 2 | 12.5% |
| 2 | 2 | 12.5% |
| 8 | 1 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13480648 | |
| Common | 20 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3848693 | |
| S | 1926391 | |
| N | 1926391 | |
| T | 1926391 | |
| P | 1922302 | |
| R | 1922302 | |
| A | 4089 | < 0.1% |
| B | 4089 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 9 | 6 | |
| - | 4 | |
| 0 | 3 | |
| 1 | 2 | 10.0% |
| 3 | 2 | 10.0% |
| 2 | 2 | 10.0% |
| 8 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13480668 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3848693 | |
| S | 1926391 | |
| N | 1926391 | |
| T | 1926391 | |
| P | 1922302 | |
| R | 1922302 | |
| A | 4089 | < 0.1% |
| B | 4089 | < 0.1% |
| 9 | 6 | < 0.1% |
| - | 4 | < 0.1% |
| Other values (5) | 10 | < 0.1% |
preparations
Text
| Distinct | 527 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1860 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 167 |
|---|---|
| Median length | 157 |
| Mean length | 10.12228005 |
| Min length | 3 |
Unique
| Unique | 212 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Alcohol (Ethanol) |
|---|---|
| 2nd row | Dry |
| 3rd row | Alcohol (Ethanol) |
| 4th row | Dry |
| 5th row | Dry |
| Value | Count | Frequency (%) |
| ethanol | 907118 | |
| dry | 902342 | |
| alcohol | 897625 | |
| slide | 129646 | 4.4% |
| 19548 | 0.7% | |
| 95 | 16839 | 0.6% |
| formalin | 12585 | 0.4% |
| biorepository | 12373 | 0.4% |
| isopropyl | 10055 | 0.3% |
| sorting | 6036 | 0.2% |
| Other values (40) | 31872 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2866431 | |
| o | 2797187 | |
| h | 1806308 | 9.3% |
| 1021506 | 5.2% | |
| r | 954329 | 4.9% |
| t | 939560 | 4.8% |
| n | 936854 | 4.8% |
| a | 925743 | 4.8% |
| y | 923987 | 4.7% |
| E | 913018 | 4.7% |
| Other values (43) | 5395739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13613226 | |
| Uppercase Letter | 2925118 | 15.0% |
| Space Separator | 1021506 | 5.2% |
| Close Punctuation | 887570 | 4.6% |
| Open Punctuation | 887570 | 4.6% |
| Other Punctuation | 86959 | 0.4% |
| Decimal Number | 39165 | 0.2% |
| Dash Punctuation | 19548 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2866431 | |
| o | 2797187 | |
| h | 1806308 | |
| r | 954329 | 7.0% |
| t | 939560 | 6.9% |
| n | 936854 | 6.9% |
| a | 925743 | 6.8% |
| y | 923987 | 6.8% |
| c | 898646 | 6.6% |
| i | 181357 | 1.3% |
| Other values (12) | 382824 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 913018 | |
| D | 902616 | |
| A | 898725 | |
| S | 153320 | 5.2% |
| I | 13804 | 0.5% |
| F | 12984 | 0.4% |
| B | 12731 | 0.4% |
| M | 5938 | 0.2% |
| R | 4592 | 0.2% |
| Y | 4591 | 0.2% |
| Other values (9) | 2799 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 18431 | |
| 5 | 17782 | |
| 0 | 1802 | 4.6% |
| 8 | 1081 | 2.8% |
| 1 | 36 | 0.1% |
| 2 | 33 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 67410 | |
| % | 19549 | 22.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1021506 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 887570 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 887570 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16538344 | |
| Common | 2942318 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2866431 | |
| o | 2797187 | |
| h | 1806308 | |
| r | 954329 | 5.8% |
| t | 939560 | 5.7% |
| n | 936854 | 5.7% |
| a | 925743 | 5.6% |
| y | 923987 | 5.6% |
| E | 913018 | 5.5% |
| D | 902616 | 5.5% |
| Other values (31) | 2572311 |
Common
| Value | Count | Frequency (%) |
| 1021506 | ||
| ) | 887570 | |
| ( | 887570 | |
| ; | 67410 | 2.3% |
| % | 19549 | 0.7% |
| - | 19548 | 0.7% |
| 9 | 18431 | 0.6% |
| 5 | 17782 | 0.6% |
| 0 | 1802 | 0.1% |
| 8 | 1081 | < 0.1% |
| Other values (2) | 69 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19480662 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2866431 | |
| o | 2797187 | |
| h | 1806308 | 9.3% |
| 1021506 | 5.2% | |
| r | 954329 | 4.9% |
| t | 939560 | 4.8% |
| n | 936854 | 4.8% |
| a | 925743 | 4.8% |
| y | 923987 | 4.7% |
| E | 913018 | 4.7% |
| Other values (43) | 5395739 |
disposition
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 252 |
|---|---|
| 2nd row | 265 |
| Value | Count | Frequency (%) |
| 252 | 1 | |
| 265 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 252 |
|---|---|
| 2nd row | 265 |
| Value | Count | Frequency (%) |
| 252 | 1 | |
| 265 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1993 |
|---|---|
| 2nd row | 1938 |
| Value | Count | Frequency (%) |
| 1993 | 1 | |
| 1938 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Missing 
| Distinct | 5098 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 1921269 |
| Missing (%) | 99.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 1349 |
|---|---|
| Median length | 49 |
| Mean length | 85.4980484 |
| Min length | 1 |
Unique
| Unique | 5082 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AY426351;https://www.ncbi.nlm.nih.gov/gquery?term=AY379442;https://www.ncbi.nlm.nih.gov/gquery?term=AY426385 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=MH825989 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT223244 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MH826372 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=KT792656 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=km521547 | 12 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kx362316;https://www.ncbi.nlm.nih.gov/gquery?term=kx362269 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ef060028;https://www.ncbi.nlm.nih.gov/gquery?term=kx362271 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj172481 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=srr9613700 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kx832080 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mk246581;https://www.ncbi.nlm.nih.gov/gquery?term=mk246487 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq307001 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay643524 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Other values (5088) | 5094 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 35419 | 8.1% |
| t | 26562 | 6.1% |
| / | 26562 | 6.1% |
| w | 26562 | 6.1% |
| n | 26562 | 6.1% |
| h | 17708 | 4.0% |
| r | 17708 | 4.0% |
| i | 17708 | 4.0% |
| e | 17708 | 4.0% |
| m | 17708 | 4.0% |
| Other values (51) | 207885 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 274474 | |
| Other Punctuation | 83421 | 19.0% |
| Decimal Number | 53457 | 12.2% |
| Uppercase Letter | 17884 | 4.1% |
| Math Symbol | 8854 | 2.0% |
| Dash Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 3906 | |
| M | 3764 | |
| W | 1587 | |
| U | 1539 | 8.6% |
| F | 833 | 4.7% |
| J | 772 | 4.3% |
| X | 719 | 4.0% |
| C | 697 | 3.9% |
| T | 538 | 3.0% |
| H | 533 | 3.0% |
| Other values (14) | 2996 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 26562 | 9.7% |
| w | 26562 | 9.7% |
| n | 26562 | 9.7% |
| h | 17708 | 6.5% |
| r | 17708 | 6.5% |
| i | 17708 | 6.5% |
| e | 17708 | 6.5% |
| m | 17708 | 6.5% |
| g | 17708 | 6.5% |
| q | 8854 | 3.2% |
| Other values (9) | 79686 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7334 | |
| 8 | 6190 | |
| 0 | 5590 | |
| 4 | 5209 | |
| 6 | 5207 | |
| 5 | 5041 | |
| 3 | 4920 | |
| 9 | 4839 | |
| 1 | 4744 | |
| 7 | 4383 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 35419 | |
| / | 26562 | |
| ? | 8854 | 10.6% |
| : | 8854 | 10.6% |
| ; | 3732 | 4.5% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 8854 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292358 | |
| Common | 145734 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 26562 | 9.1% |
| w | 26562 | 9.1% |
| n | 26562 | 9.1% |
| h | 17708 | 6.1% |
| r | 17708 | 6.1% |
| i | 17708 | 6.1% |
| e | 17708 | 6.1% |
| m | 17708 | 6.1% |
| g | 17708 | 6.1% |
| q | 8854 | 3.0% |
| Other values (33) | 97570 |
Common
| Value | Count | Frequency (%) |
| . | 35419 | |
| / | 26562 | |
| = | 8854 | 6.1% |
| ? | 8854 | 6.1% |
| : | 8854 | 6.1% |
| 2 | 7334 | 5.0% |
| 8 | 6190 | 4.2% |
| 0 | 5590 | 3.8% |
| 4 | 5209 | 3.6% |
| 6 | 5207 | 3.6% |
| Other values (8) | 27661 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 438092 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 35419 | 8.1% |
| t | 26562 | 6.1% |
| / | 26562 | 6.1% |
| w | 26562 | 6.1% |
| n | 26562 | 6.1% |
| h | 17708 | 4.0% |
| r | 17708 | 4.0% |
| i | 17708 | 4.0% |
| e | 17708 | 4.0% |
| m | 17708 | 4.0% |
| Other values (51) | 207885 |
associatedTaxa
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1.5 |
| Mean length | 1.5 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 22 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 22 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 9 | 1 |
Missing 
| Distinct | 384906 |
|---|---|
| Distinct (%) | 49.2% |
| Missing | 1144485 |
| Missing (%) | 59.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 48983 |
|---|---|
| Median length | 1371 |
| Mean length | 61.51201036 |
| Min length | 1 |
Unique
| Unique | 322690 ? |
|---|---|
| Unique (%) | 41.3% |
Sample
| 1st row | Jewett.; Stearns. |
|---|---|
| 2nd row | Bartsch |
| 3rd row | 15 Nov. 1973; Jones, Dawson, del Rosario; Fitzgerald; NMNH-STRI Survey |
| 4th row | U. S. B. Fish |
| 5th row | C.R. Laws |
| Value | Count | Frequency (%) |
| coll | 143199 | 2.1% |
| of | 115369 | 1.7% |
| and | 111363 | 1.7% |
| a | 107288 | 1.6% |
| by | 89612 | 1.3% |
| 87811 | 1.3% | |
| 2 | 65618 | 1.0% |
| 3 | 63129 | 0.9% |
| was | 62154 | 0.9% |
| formalin | 58892 | 0.9% |
| Other values (238105) | 5777747 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5892887 | 12.3% | |
| e | 2965997 | 6.2% |
| o | 2602001 | 5.4% |
| a | 2414749 | 5.0% |
| i | 2010061 | 4.2% |
| t | 1978195 | 4.1% |
| n | 1975689 | 4.1% |
| r | 1877425 | 3.9% |
| s | 1858443 | 3.9% |
| l | 1812957 | 3.8% |
| Other values (123) | 22708329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27332023 | |
| Space Separator | 5892887 | 12.3% |
| Uppercase Letter | 5695540 | 11.8% |
| Other Punctuation | 5002846 | 10.4% |
| Decimal Number | 3447946 | 7.2% |
| Dash Punctuation | 299803 | 0.6% |
| Open Punctuation | 185687 | 0.4% |
| Close Punctuation | 185536 | 0.4% |
| Control | 24753 | 0.1% |
| Math Symbol | 15128 | < 0.1% |
| Other values (8) | 14584 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2965997 | |
| o | 2602001 | 9.5% |
| a | 2414749 | 8.8% |
| i | 2010061 | 7.4% |
| t | 1978195 | 7.2% |
| n | 1975689 | 7.2% |
| r | 1877425 | 6.9% |
| s | 1858443 | 6.8% |
| l | 1812957 | 6.6% |
| d | 1161749 | 4.3% |
| Other values (32) | 6674757 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 697325 | 12.2% |
| S | 676219 | 11.9% |
| B | 359074 | 6.3% |
| F | 347516 | 6.1% |
| P | 326048 | 5.7% |
| N | 312938 | 5.5% |
| M | 290198 | 5.1% |
| A | 263171 | 4.6% |
| R | 240076 | 4.2% |
| H | 232062 | 4.1% |
| Other values (17) | 1950913 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 1194440 | |
| . | 1192250 | |
| ; | 1044076 | |
| , | 582555 | |
| : | 568537 | |
| % | 166891 | 3.3% |
| / | 97157 | 1.9% |
| ! | 65397 | 1.3% |
| ' | 33800 | 0.7% |
| # | 25850 | 0.5% |
| Other values (6) | 31893 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 686584 | |
| 2 | 449189 | |
| 9 | 387732 | |
| 0 | 371791 | |
| 3 | 303064 | |
| 7 | 287058 | |
| 5 | 256778 | 7.4% |
| 6 | 251883 | 7.3% |
| 4 | 239793 | 7.0% |
| 8 | 214074 | 6.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 11235 | |
| = | 1994 | 13.2% |
| | | 1638 | 10.8% |
| > | 140 | 0.9% |
| ~ | 94 | 0.6% |
| < | 23 | 0.2% |
| ± | 2 | < 0.1% |
| × | 2 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3563 | |
| ♂ | 91 | 2.5% |
| ♀ | 49 | 1.3% |
| ⚥ | 6 | 0.2% |
| © | 2 | 0.1% |
| ® | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 299333 | |
| – | 469 | 0.2% |
| — | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 95227 | |
| { | 87819 | |
| [ | 2641 | 1.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 95098 | |
| } | 87813 | |
| ] | 2625 | 1.4% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 | |
| ¼ | 1 | |
| ³ | 1 |
Control
| Value | Count | Frequency (%) |
| 24642 | ||
| 111 | 0.4% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 383 | |
| € | 2 | 0.5% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 213 | |
| » | 1 | 0.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 213 | |
| « | 1 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 5892887 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8926 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1128 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33028655 | |
| Common | 15068070 | |
| Greek | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2965997 | 9.0% |
| o | 2602001 | 7.9% |
| a | 2414749 | 7.3% |
| i | 2010061 | 6.1% |
| t | 1978195 | 6.0% |
| n | 1975689 | 6.0% |
| r | 1877425 | 5.7% |
| s | 1858443 | 5.6% |
| l | 1812957 | 5.5% |
| d | 1161749 | 3.5% |
| Other values (57) | 12371389 |
Common
| Value | Count | Frequency (%) |
| 5892887 | ||
| " | 1194440 | 7.9% |
| . | 1192250 | 7.9% |
| ; | 1044076 | 6.9% |
| 1 | 686584 | 4.6% |
| , | 582555 | 3.9% |
| : | 568537 | 3.8% |
| 2 | 449189 | 3.0% |
| 9 | 387732 | 2.6% |
| 0 | 371791 | 2.5% |
| Other values (54) | 2698029 |
Greek
| Value | Count | Frequency (%) |
| μ | 7 | |
| π | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48090219 | |
| None | 5328 | < 0.1% |
| Punctuation | 1038 | < 0.1% |
| Misc Symbols | 146 | < 0.1% |
| Currency Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5892887 | 12.3% | |
| e | 2965997 | 6.2% |
| o | 2602001 | 5.4% |
| a | 2414749 | 5.0% |
| i | 2010061 | 4.2% |
| t | 1978195 | 4.1% |
| n | 1975689 | 4.1% |
| r | 1877425 | 3.9% |
| s | 1858443 | 3.9% |
| l | 1812957 | 3.8% |
| Other values (86) | 22701815 |
None
| Value | Count | Frequency (%) |
| ° | 3563 | |
| º | 1128 | 21.2% |
| é | 388 | 7.3% |
| ü | 91 | 1.7% |
| ö | 31 | 0.6% |
| µ | 28 | 0.5% |
| ã | 14 | 0.3% |
| à | 12 | 0.2% |
| ó | 11 | 0.2% |
| á | 11 | 0.2% |
| Other values (18) | 51 | 1.0% |
Punctuation
| Value | Count | Frequency (%) |
| – | 469 | |
| ” | 213 | |
| “ | 213 | |
| … | 142 | 13.7% |
| — | 1 | 0.1% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 91 | |
| ♀ | 49 | |
| ⚥ | 6 | 4.1% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 |
verbatimLabel
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 48.5 |
| Mean length | 48.5 |
| Min length | 35 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, North Pacific Ocean, Gulf Of California, Mexico |
|---|---|
| 2nd row | North America, United States, Texas |
| Value | Count | Frequency (%) |
| north | 3 | |
| america | 2 | |
| pacific | 1 | 7.1% |
| ocean | 1 | 7.1% |
| gulf | 1 | 7.1% |
| of | 1 | 7.1% |
| california | 1 | 7.1% |
| mexico | 1 | 7.1% |
| united | 1 | 7.1% |
| states | 1 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12 | 12.4% | |
| a | 8 | 8.2% |
| i | 8 | 8.2% |
| e | 7 | 7.2% |
| r | 6 | 6.2% |
| t | 6 | 6.2% |
| c | 6 | 6.2% |
| o | 5 | 5.2% |
| , | 5 | 5.2% |
| f | 4 | 4.1% |
| Other values (18) | 30 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66 | |
| Uppercase Letter | 14 | 14.4% |
| Space Separator | 12 | 12.4% |
| Other Punctuation | 5 | 5.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 8 | |
| e | 7 | |
| r | 6 | |
| t | 6 | |
| c | 6 | |
| o | 5 | |
| f | 4 | 6.1% |
| n | 3 | 4.5% |
| h | 3 | 4.5% |
| Other values (6) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3 | |
| A | 2 | |
| O | 2 | |
| P | 1 | 7.1% |
| G | 1 | 7.1% |
| C | 1 | 7.1% |
| M | 1 | 7.1% |
| U | 1 | 7.1% |
| S | 1 | 7.1% |
| T | 1 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80 | |
| Common | 17 | 17.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | 10.0% |
| i | 8 | 10.0% |
| e | 7 | 8.8% |
| r | 6 | 7.5% |
| t | 6 | 7.5% |
| c | 6 | 7.5% |
| o | 5 | 6.2% |
| f | 4 | 5.0% |
| n | 3 | 3.8% |
| N | 3 | 3.8% |
| Other values (16) | 24 |
Common
| Value | Count | Frequency (%) |
| 12 | ||
| , | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12 | 12.4% | |
| a | 8 | 8.2% |
| i | 8 | 8.2% |
| e | 7 | 7.2% |
| r | 6 | 6.2% |
| t | 6 | 6.2% |
| c | 6 | 6.2% |
| o | 5 | 5.2% |
| , | 5 | 5.2% |
| f | 4 | 4.1% |
| Other values (18) | 30 |
materialSampleID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 4 | |
| A | 4 | |
| N | 2 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| _ | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 24 | |
| Connector Punctuation | 2 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 4 | |
| A | 4 | |
| N | 2 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24 | |
| Common | 2 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 4 | |
| A | 4 | |
| N | 2 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 | |
| C | 2 |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 4 | |
| A | 4 | |
| N | 2 | |
| O | 2 | |
| T | 2 | |
| H | 2 | |
| _ | 2 | |
| M | 2 | |
| E | 2 | |
| I | 2 |
eventID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 39 |
| Mean length | 39 |
| Min length | 39 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North Pacific Ocean, Gulf Of California |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| pacific | 1 | |
| ocean | 1 | |
| gulf | 1 | |
| of | 1 | |
| california | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | ||
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | 7.7% |
| n | 2 | 5.1% |
| r | 2 | 5.1% |
| l | 2 | 5.1% |
| o | 2 | 5.1% |
| O | 2 | 5.1% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27 | |
| Uppercase Letter | 6 | 15.4% |
| Space Separator | 5 | 12.8% |
| Other Punctuation | 1 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | |
| n | 2 | |
| r | 2 | |
| l | 2 | |
| o | 2 | |
| u | 1 | 3.7% |
| e | 1 | 3.7% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2 | |
| G | 1 | |
| N | 1 | |
| P | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33 | |
| Common | 6 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | |
| n | 2 | 6.1% |
| r | 2 | 6.1% |
| l | 2 | 6.1% |
| o | 2 | 6.1% |
| O | 2 | 6.1% |
| u | 1 | 3.0% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | ||
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | 7.7% |
| n | 2 | 5.1% |
| r | 2 | 5.1% |
| l | 2 | 5.1% |
| o | 2 | 5.1% |
| O | 2 | 5.1% |
| Other values (9) | 9 |
fieldNumber
Text
Missing 
| Distinct | 62652 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 1339759 |
| Missing (%) | 69.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 63 |
| Mean length | 13.61565474 |
| Min length | 1 |
Unique
| Unique | 27490 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | MMS-CABP/02B-E4 |
|---|---|
| 2nd row | 4/III-23-TDS |
| 3rd row | USARP/EL/12/1002/USC |
| 4th row | USFC/A2059 |
| 5th row | USFC/A5374 |
| Value | Count | Frequency (%) |
| mms-mafla/jar | 17292 | 2.6% |
| bolland/rfb | 7605 | 1.1% |
| humes | 5243 | 0.8% |
| jpem | 5029 | 0.8% |
| 4975 | 0.8% | |
| rh | 2306 | 0.3% |
| k-rh | 1557 | 0.2% |
| spm | 1164 | 0.2% |
| mnhn-norfolk | 1131 | 0.2% |
| haul | 1040 | 0.2% |
| Other values (59086) | 614438 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 742746 | 9.3% |
| S | 650690 | 8.1% |
| M | 501374 | 6.3% |
| - | 480058 | 6.0% |
| A | 421866 | 5.3% |
| 1 | 403237 | 5.0% |
| 0 | 377832 | 4.7% |
| C | 368160 | 4.6% |
| 2 | 360968 | 4.5% |
| U | 266532 | 3.3% |
| Other values (72) | 3413943 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3902312 | |
| Decimal Number | 2536931 | |
| Other Punctuation | 835673 | 10.5% |
| Dash Punctuation | 480058 | 6.0% |
| Lowercase Letter | 145893 | 1.8% |
| Space Separator | 75146 | 0.9% |
| Connector Punctuation | 7573 | 0.1% |
| Open Punctuation | 1756 | < 0.1% |
| Close Punctuation | 1756 | < 0.1% |
| Math Symbol | 302 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 650690 | |
| M | 501374 | |
| A | 421866 | |
| C | 368160 | |
| U | 266532 | 6.8% |
| F | 236190 | 6.1% |
| I | 186859 | 4.8% |
| R | 170619 | 4.4% |
| L | 169981 | 4.4% |
| P | 165609 | 4.2% |
| Other values (16) | 764432 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25305 | |
| r | 24955 | |
| a | 23105 | |
| l | 9450 | 6.5% |
| s | 8105 | 5.6% |
| i | 7884 | 5.4% |
| o | 7864 | 5.4% |
| u | 7559 | 5.2% |
| m | 5786 | 4.0% |
| t | 4694 | 3.2% |
| Other values (16) | 21186 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 742746 | |
| : | 80860 | 9.7% |
| . | 4233 | 0.5% |
| ; | 3671 | 0.4% |
| , | 2634 | 0.3% |
| # | 938 | 0.1% |
| \ | 340 | < 0.1% |
| ? | 150 | < 0.1% |
| & | 61 | < 0.1% |
| " | 16 | < 0.1% |
| Other values (2) | 24 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 403237 | |
| 0 | 377832 | |
| 2 | 360968 | |
| 5 | 260750 | |
| 3 | 252386 | |
| 4 | 217309 | |
| 7 | 192322 | |
| 6 | 178170 | |
| 8 | 164692 | |
| 9 | 129265 | 5.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 290 | |
| = | 12 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 480058 |
Space Separator
| Value | Count | Frequency (%) |
| 75146 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7573 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1756 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1756 |
Control
| Value | Count | Frequency (%) |
| | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4048205 | |
| Common | 3939201 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 650690 | |
| M | 501374 | |
| A | 421866 | |
| C | 368160 | |
| U | 266532 | 6.6% |
| F | 236190 | 5.8% |
| I | 186859 | 4.6% |
| R | 170619 | 4.2% |
| L | 169981 | 4.2% |
| P | 165609 | 4.1% |
| Other values (42) | 910325 |
Common
| Value | Count | Frequency (%) |
| / | 742746 | |
| - | 480058 | |
| 1 | 403237 | |
| 0 | 377832 | |
| 2 | 360968 | |
| 5 | 260750 | 6.6% |
| 3 | 252386 | 6.4% |
| 4 | 217309 | 5.5% |
| 7 | 192322 | 4.9% |
| 6 | 178170 | 4.5% |
| Other values (20) | 473423 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7987406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 742746 | 9.3% |
| S | 650690 | 8.1% |
| M | 501374 | 6.3% |
| - | 480058 | 6.0% |
| A | 421866 | 5.3% |
| 1 | 403237 | 5.0% |
| 0 | 377832 | 4.7% |
| C | 368160 | 4.6% |
| 2 | 360968 | 4.5% |
| U | 266532 | 3.3% |
| Other values (72) | 3413943 |
eventDate
Text
Missing 
| Distinct | 45561 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 688611 |
| Missing (%) | 35.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.825816662 |
| Min length | 4 |
Unique
| Unique | 6824 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 1976-03-03 |
|---|---|
| 2nd row | 1984-05-15 |
| 3rd row | 1964-03-15 |
| 4th row | 1883-08-31 |
| 5th row | 1909-03-02 |
| Value | Count | Frequency (%) |
| 1915 | 6254 | 0.5% |
| 1982-07-21 | 5684 | 0.5% |
| 1981-07-06 | 5412 | 0.4% |
| 1983-05-13 | 5155 | 0.4% |
| 1982-11-19 | 5039 | 0.4% |
| 1982-02-10 | 4461 | 0.4% |
| 1981-11-09 | 4297 | 0.3% |
| 1913 | 4293 | 0.3% |
| 1982-05-10 | 4269 | 0.3% |
| 1977-01-28/1977-02-13 | 3795 | 0.3% |
| Other values (45551) | 1189123 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2343420 | |
| - | 2329130 | |
| 0 | 1804499 | |
| 9 | 1499550 | |
| 2 | 828832 | 6.8% |
| 8 | 778911 | 6.4% |
| 7 | 716498 | 5.9% |
| 6 | 564568 | 4.6% |
| 5 | 436405 | 3.6% |
| 3 | 431150 | 3.5% |
| Other values (7) | 429256 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9788344 | |
| Dash Punctuation | 2329130 | 19.2% |
| Other Punctuation | 44740 | 0.4% |
| Lowercase Letter | 4 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2343420 | |
| 0 | 1804499 | |
| 9 | 1499550 | |
| 2 | 828832 | 8.5% |
| 8 | 778911 | 8.0% |
| 7 | 716498 | 7.3% |
| 6 | 564568 | 5.8% |
| 5 | 436405 | 4.5% |
| 3 | 431150 | 4.4% |
| 4 | 384511 | 3.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| x | 1 | |
| a | 1 | |
| s | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2329130 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 44740 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12162214 | |
| Latin | 5 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2343420 | |
| - | 2329130 | |
| 0 | 1804499 | |
| 9 | 1499550 | |
| 2 | 828832 | 6.8% |
| 8 | 778911 | 6.4% |
| 7 | 716498 | 5.9% |
| 6 | 564568 | 4.6% |
| 5 | 436405 | 3.6% |
| 3 | 431150 | 3.5% |
| Other values (2) | 429251 | 3.5% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| e | 1 | |
| x | 1 | |
| a | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12162219 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2343420 | |
| - | 2329130 | |
| 0 | 1804499 | |
| 9 | 1499550 | |
| 2 | 828832 | 6.8% |
| 8 | 778911 | 6.4% |
| 7 | 716498 | 5.9% |
| 6 | 564568 | 4.6% |
| 5 | 436405 | 3.6% |
| 3 | 431150 | 3.5% |
| Other values (7) | 429256 | 3.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 842313 |
| Missing (%) | 43.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.737319202 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 63 |
|---|---|
| 2nd row | 136 |
| 3rd row | 75 |
| 4th row | 243 |
| 5th row | 61 |
| Value | Count | Frequency (%) |
| 202 | 9215 | 0.9% |
| 133 | 9048 | 0.8% |
| 187 | 8343 | 0.8% |
| 130 | 7952 | 0.7% |
| 323 | 7925 | 0.7% |
| 41 | 7863 | 0.7% |
| 145 | 7055 | 0.7% |
| 313 | 6543 | 0.6% |
| 175 | 6524 | 0.6% |
| 263 | 6356 | 0.6% |
| Other values (356) | 1007256 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 581036 | |
| 2 | 554744 | |
| 3 | 415233 | |
| 4 | 233074 | |
| 5 | 216424 | 7.3% |
| 0 | 206942 | 7.0% |
| 6 | 200957 | 6.8% |
| 9 | 195331 | 6.6% |
| 7 | 188977 | 6.4% |
| 8 | 174755 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2967473 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 581036 | |
| 2 | 554744 | |
| 3 | 415233 | |
| 4 | 233074 | |
| 5 | 216424 | 7.3% |
| 0 | 206942 | 7.0% |
| 6 | 200957 | 6.8% |
| 9 | 195331 | 6.6% |
| 7 | 188977 | 6.4% |
| 8 | 174755 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2967473 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 581036 | |
| 2 | 554744 | |
| 3 | 415233 | |
| 4 | 233074 | |
| 5 | 216424 | 7.3% |
| 0 | 206942 | 7.0% |
| 6 | 200957 | 6.8% |
| 9 | 195331 | 6.6% |
| 7 | 188977 | 6.4% |
| 8 | 174755 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2967473 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 581036 | |
| 2 | 554744 | |
| 3 | 415233 | |
| 4 | 233074 | |
| 5 | 216424 | 7.3% |
| 0 | 206942 | 7.0% |
| 6 | 200957 | 6.8% |
| 9 | 195331 | 6.6% |
| 7 | 188977 | 6.4% |
| 8 | 174755 | 5.9% |
endDayOfYear
Text
Missing 
| Distinct | 368 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 842311 |
| Missing (%) | 43.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 2.738167408 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 63 |
|---|---|
| 2nd row | 136 |
| 3rd row | 75 |
| 4th row | 243 |
| 5th row | 61 |
| Value | Count | Frequency (%) |
| 202 | 9184 | 0.8% |
| 133 | 9037 | 0.8% |
| 187 | 8347 | 0.8% |
| 41 | 7969 | 0.7% |
| 323 | 7925 | 0.7% |
| 130 | 7869 | 0.7% |
| 153 | 7303 | 0.7% |
| 313 | 6544 | 0.6% |
| 191 | 6380 | 0.6% |
| 44 | 6227 | 0.6% |
| Other values (360) | 1007299 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 585916 | |
| 2 | 551639 | |
| 3 | 416638 | |
| 4 | 236593 | |
| 5 | 218340 | 7.4% |
| 0 | 208775 | 7.0% |
| 6 | 195661 | 6.6% |
| 9 | 190870 | 6.4% |
| 7 | 187463 | 6.3% |
| 8 | 176487 | 5.9% |
| Other values (10) | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2968382 | |
| Lowercase Letter | 10 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 585916 | |
| 2 | 551639 | |
| 3 | 416638 | |
| 4 | 236593 | |
| 5 | 218340 | 7.4% |
| 0 | 208775 | 7.0% |
| 6 | 195661 | 6.6% |
| 9 | 190870 | 6.4% |
| 7 | 187463 | 6.3% |
| 8 | 176487 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| z | 1 | 10.0% |
| g | 1 | 10.0% |
| l | 1 | 10.0% |
| k | 1 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 | |
| P | 1 | |
| E | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2968384 | |
| Latin | 14 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 585916 | |
| 2 | 551639 | |
| 3 | 416638 | |
| 4 | 236593 | |
| 5 | 218340 | 7.4% |
| 0 | 208775 | 7.0% |
| 6 | 195661 | 6.6% |
| 9 | 190870 | 6.4% |
| 7 | 187463 | 6.3% |
| 8 | 176487 | 5.9% |
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| L | 2 | |
| P | 1 | 7.1% |
| z | 1 | 7.1% |
| E | 1 | 7.1% |
| g | 1 | 7.1% |
| l | 1 | 7.1% |
| k | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2968398 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 585916 | |
| 2 | 551639 | |
| 3 | 416638 | |
| 4 | 236593 | |
| 5 | 218340 | 7.4% |
| 0 | 208775 | 7.0% |
| 6 | 195661 | 6.6% |
| 9 | 190870 | 6.4% |
| 7 | 187463 | 6.3% |
| 8 | 176487 | 5.9% |
| Other values (10) | 16 | < 0.1% |
year
Text
Missing 
| Distinct | 207 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 689273 |
| Missing (%) | 35.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1976 |
|---|---|
| 2nd row | 1984 |
| 3rd row | 1964 |
| 4th row | 1883 |
| 5th row | 1909 |
| Value | Count | Frequency (%) |
| 1977 | 73835 | 6.0% |
| 1981 | 43749 | 3.5% |
| 1976 | 42199 | 3.4% |
| 1984 | 38196 | 3.1% |
| 1982 | 38145 | 3.1% |
| 1908 | 35299 | 2.9% |
| 1983 | 34031 | 2.8% |
| 1985 | 30482 | 2.5% |
| 1964 | 28236 | 2.3% |
| 1975 | 25013 | 2.0% |
| Other values (197) | 847935 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1361026 | |
| 9 | 1244662 | |
| 8 | 523421 | 10.6% |
| 7 | 428827 | 8.7% |
| 6 | 322906 | 6.5% |
| 0 | 305534 | 6.2% |
| 2 | 219318 | 4.4% |
| 5 | 194130 | 3.9% |
| 4 | 177446 | 3.6% |
| 3 | 171210 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4948480 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1361026 | |
| 9 | 1244662 | |
| 8 | 523421 | 10.6% |
| 7 | 428827 | 8.7% |
| 6 | 322906 | 6.5% |
| 0 | 305534 | 6.2% |
| 2 | 219318 | 4.4% |
| 5 | 194130 | 3.9% |
| 4 | 177446 | 3.6% |
| 3 | 171210 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4948480 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1361026 | |
| 9 | 1244662 | |
| 8 | 523421 | 10.6% |
| 7 | 428827 | 8.7% |
| 6 | 322906 | 6.5% |
| 0 | 305534 | 6.2% |
| 2 | 219318 | 4.4% |
| 5 | 194130 | 3.9% |
| 4 | 177446 | 3.6% |
| 3 | 171210 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4948480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1361026 | |
| 9 | 1244662 | |
| 8 | 523421 | 10.6% |
| 7 | 428827 | 8.7% |
| 6 | 322906 | 6.5% |
| 0 | 305534 | 6.2% |
| 2 | 219318 | 4.4% |
| 5 | 194130 | 3.9% |
| 4 | 177446 | 3.6% |
| 3 | 171210 | 3.5% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 800939 |
| Missing (%) | 41.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.191259705 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 5 |
| 3rd row | 3 |
| 4th row | 8 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 8 | 129894 | |
| 5 | 124558 | |
| 7 | 123176 | |
| 6 | 104255 | |
| 4 | 99639 | |
| 11 | 96677 | |
| 2 | 95459 | |
| 3 | 89439 | |
| 9 | 80447 | |
| 10 | 66176 | |
| Other values (2) | 115734 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 375264 | |
| 2 | 147860 | 11.0% |
| 8 | 129894 | 9.7% |
| 5 | 124558 | 9.3% |
| 7 | 123176 | 9.2% |
| 6 | 104255 | 7.8% |
| 4 | 99639 | 7.4% |
| 3 | 89439 | 6.7% |
| 9 | 80447 | 6.0% |
| 0 | 66176 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1340708 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 375264 | |
| 2 | 147860 | 11.0% |
| 8 | 129894 | 9.7% |
| 5 | 124558 | 9.3% |
| 7 | 123176 | 9.2% |
| 6 | 104255 | 7.8% |
| 4 | 99639 | 7.4% |
| 3 | 89439 | 6.7% |
| 9 | 80447 | 6.0% |
| 0 | 66176 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1340708 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 375264 | |
| 2 | 147860 | 11.0% |
| 8 | 129894 | 9.7% |
| 5 | 124558 | 9.3% |
| 7 | 123176 | 9.2% |
| 6 | 104255 | 7.8% |
| 4 | 99639 | 7.4% |
| 3 | 89439 | 6.7% |
| 9 | 80447 | 6.0% |
| 0 | 66176 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1340708 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 375264 | |
| 2 | 147860 | 11.0% |
| 8 | 129894 | 9.7% |
| 5 | 124558 | 9.3% |
| 7 | 123176 | 9.2% |
| 6 | 104255 | 7.8% |
| 4 | 99639 | 7.4% |
| 3 | 89439 | 6.7% |
| 9 | 80447 | 6.0% |
| 0 | 66176 | 4.9% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 887053 |
| Missing (%) | 46.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.70051956 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 15 |
| 3rd row | 15 |
| 4th row | 31 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 13 | 42864 | 4.1% |
| 10 | 42434 | 4.1% |
| 19 | 40651 | 3.9% |
| 6 | 39463 | 3.8% |
| 21 | 37986 | 3.7% |
| 9 | 37781 | 3.6% |
| 15 | 37214 | 3.6% |
| 18 | 36290 | 3.5% |
| 14 | 35493 | 3.4% |
| 16 | 35080 | 3.4% |
| Other values (21) | 654084 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 488527 | |
| 2 | 412832 | |
| 3 | 151754 | 8.6% |
| 9 | 105393 | 6.0% |
| 0 | 105122 | 5.9% |
| 5 | 103912 | 5.9% |
| 6 | 103662 | 5.9% |
| 8 | 100571 | 5.7% |
| 4 | 99991 | 5.7% |
| 7 | 95654 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1767418 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 488527 | |
| 2 | 412832 | |
| 3 | 151754 | 8.6% |
| 9 | 105393 | 6.0% |
| 0 | 105122 | 5.9% |
| 5 | 103912 | 5.9% |
| 6 | 103662 | 5.9% |
| 8 | 100571 | 5.7% |
| 4 | 99991 | 5.7% |
| 7 | 95654 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1767418 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 488527 | |
| 2 | 412832 | |
| 3 | 151754 | 8.6% |
| 9 | 105393 | 6.0% |
| 0 | 105122 | 5.9% |
| 5 | 103912 | 5.9% |
| 6 | 103662 | 5.9% |
| 8 | 100571 | 5.7% |
| 4 | 99991 | 5.7% |
| 7 | 95654 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1767418 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 488527 | |
| 2 | 412832 | |
| 3 | 151754 | 8.6% |
| 9 | 105393 | 6.0% |
| 0 | 105122 | 5.9% |
| 5 | 103912 | 5.9% |
| 6 | 103662 | 5.9% |
| 8 | 100571 | 5.7% |
| 4 | 99991 | 5.7% |
| 7 | 95654 | 5.4% |
Missing 
| Distinct | 47776 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 1173199 |
| Missing (%) | 60.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 181 |
|---|---|
| Median length | 11 |
| Mean length | 11.01797943 |
| Min length | 1 |
Unique
| Unique | 15837 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | -- --- ---- |
|---|---|
| 2nd row | 15 MAY 1984 |
| 3rd row | 15 MAR 1964 |
| 4th row | 03 MAR 1967 |
| 5th row | 31 AUG 1958 |
| Value | Count | Frequency (%) |
| 275912 | 12.6% | |
| may | 68627 | 3.1% |
| aug | 65853 | 3.0% |
| jul | 61532 | 2.8% |
| apr | 57935 | 2.6% |
| feb | 53288 | 2.4% |
| jun | 52783 | 2.4% |
| nov | 52211 | 2.4% |
| mar | 46122 | 2.1% |
| 1977 | 42132 | 1.9% |
| Other values (8403) | 1419007 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1442208 | ||
| 1 | 1077550 | |
| 9 | 807908 | 9.7% |
| - | 749611 | 9.0% |
| 2 | 340282 | 4.1% |
| 7 | 334273 | 4.0% |
| 0 | 322856 | 3.9% |
| 8 | 301958 | 3.6% |
| 6 | 296090 | 3.6% |
| A | 274119 | 3.3% |
| Other values (71) | 2351821 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4021101 | |
| Uppercase Letter | 1821588 | |
| Space Separator | 1442208 | 17.4% |
| Dash Punctuation | 749611 | 9.0% |
| Lowercase Letter | 202119 | 2.4% |
| Other Punctuation | 58056 | 0.7% |
| Close Punctuation | 1860 | < 0.1% |
| Open Punctuation | 1857 | < 0.1% |
| Connector Punctuation | 187 | < 0.1% |
| Math Symbol | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 23085 | |
| r | 22928 | |
| l | 19199 | |
| n | 18909 | |
| i | 17653 | |
| a | 15944 | |
| t | 14270 | |
| p | 13298 | 6.6% |
| g | 11952 | 5.9% |
| u | 11197 | 5.5% |
| Other values (15) | 33684 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 274119 | |
| U | 176026 | 9.7% |
| J | 155789 | 8.6% |
| N | 143298 | 7.9% |
| M | 120575 | 6.6% |
| E | 116136 | 6.4% |
| R | 101244 | 5.6% |
| P | 93222 | 5.1% |
| O | 88378 | 4.9% |
| Y | 68082 | 3.7% |
| Other values (14) | 484719 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19795 | |
| / | 15745 | |
| , | 11253 | |
| : | 9409 | |
| ; | 983 | 1.7% |
| ? | 319 | 0.5% |
| & | 294 | 0.5% |
| ' | 244 | 0.4% |
| " | 9 | < 0.1% |
| \ | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1077550 | |
| 9 | 807908 | |
| 2 | 340282 | 8.5% |
| 7 | 334273 | 8.3% |
| 0 | 322856 | 8.0% |
| 8 | 301958 | 7.5% |
| 6 | 296090 | 7.4% |
| 3 | 203568 | 5.1% |
| 5 | 177563 | 4.4% |
| 4 | 159053 | 4.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 80 | |
| ~ | 8 | 9.0% |
| < | 1 | 1.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1838 | |
| ] | 22 | 1.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1837 | |
| [ | 20 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1442208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 749611 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6274969 | |
| Latin | 2023707 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 274119 | 13.5% |
| U | 176026 | 8.7% |
| J | 155789 | 7.7% |
| N | 143298 | 7.1% |
| M | 120575 | 6.0% |
| E | 116136 | 5.7% |
| R | 101244 | 5.0% |
| P | 93222 | 4.6% |
| O | 88378 | 4.4% |
| Y | 68082 | 3.4% |
| Other values (39) | 686838 |
Common
| Value | Count | Frequency (%) |
| 1442208 | ||
| 1 | 1077550 | |
| 9 | 807908 | |
| - | 749611 | |
| 2 | 340282 | 5.4% |
| 7 | 334273 | 5.3% |
| 0 | 322856 | 5.1% |
| 8 | 301958 | 4.8% |
| 6 | 296090 | 4.7% |
| 3 | 203568 | 3.2% |
| Other values (22) | 398665 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8298676 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1442208 | ||
| 1 | 1077550 | |
| 9 | 807908 | 9.7% |
| - | 749611 | 9.0% |
| 2 | 340282 | 4.1% |
| 7 | 334273 | 4.0% |
| 0 | 322856 | 3.9% |
| 8 | 301958 | 3.6% |
| 6 | 296090 | 3.6% |
| A | 274119 | 3.3% |
| Other values (71) | 2351821 |
habitat
Text
Missing 
| Distinct | 18961 |
|---|---|
| Distinct (%) | 27.4% |
| Missing | 1857136 |
| Missing (%) | 96.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 235 |
|---|---|
| Median length | 159 |
| Mean length | 19.79818646 |
| Min length | 1 |
Unique
| Unique | 13600 ? |
|---|---|
| Unique (%) | 19.6% |
Sample
| 1st row | Beach with fresh water creek running into it |
|---|---|
| 2nd row | Freshwater |
| 3rd row | In sand |
| 4th row | Mangrove |
| 5th row | Under rocks |
| Value | Count | Frequency (%) |
| freshwater | 9208 | 4.1% |
| in | 6886 | 3.1% |
| on | 6374 | 2.8% |
| reef | 6192 | 2.8% |
| sand | 6092 | 2.7% |
| coral | 5812 | 2.6% |
| of | 4886 | 2.2% |
| rocks | 4639 | 2.1% |
| sp | 4290 | 1.9% |
| intertidal | 4238 | 1.9% |
| Other values (6965) | 165798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 155158 | 11.3% | |
| e | 134098 | 9.8% |
| a | 117967 | 8.6% |
| r | 101199 | 7.4% |
| n | 83052 | 6.1% |
| s | 82888 | 6.0% |
| o | 79802 | 5.8% |
| t | 71848 | 5.2% |
| i | 60753 | 4.4% |
| l | 60225 | 4.4% |
| Other values (79) | 424173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1121766 | |
| Space Separator | 155158 | 11.3% |
| Uppercase Letter | 60796 | 4.4% |
| Other Punctuation | 20726 | 1.5% |
| Decimal Number | 6945 | 0.5% |
| Math Symbol | 2493 | 0.2% |
| Dash Punctuation | 1845 | 0.1% |
| Open Punctuation | 719 | 0.1% |
| Close Punctuation | 714 | 0.1% |
| Other Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 134098 | |
| a | 117967 | |
| r | 101199 | 9.0% |
| n | 83052 | 7.4% |
| s | 82888 | 7.4% |
| o | 79802 | 7.1% |
| t | 71848 | 6.4% |
| i | 60753 | 5.4% |
| l | 60225 | 5.4% |
| d | 54663 | 4.9% |
| Other values (18) | 275271 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 12119 | |
| L | 6196 | |
| S | 6181 | |
| I | 5575 | |
| R | 4363 | 7.2% |
| O | 3937 | 6.5% |
| M | 3425 | 5.6% |
| C | 3204 | 5.3% |
| U | 2435 | 4.0% |
| B | 2327 | 3.8% |
| Other values (16) | 11034 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10235 | |
| . | 7694 | |
| ; | 838 | 4.0% |
| / | 686 | 3.3% |
| ' | 442 | 2.1% |
| # | 299 | 1.4% |
| & | 196 | 0.9% |
| : | 111 | 0.5% |
| % | 90 | 0.4% |
| " | 75 | 0.4% |
| Other values (3) | 60 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1217 | |
| 0 | 1157 | |
| 2 | 887 | |
| 5 | 750 | |
| 3 | 666 | |
| 4 | 598 | |
| 6 | 523 | |
| 8 | 390 | 5.6% |
| 7 | 387 | 5.6% |
| 9 | 370 | 5.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2456 | |
| = | 24 | 1.0% |
| < | 7 | 0.3% |
| ~ | 4 | 0.2% |
| > | 2 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 715 | |
| [ | 4 | 0.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 711 | |
| ] | 3 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 155158 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1845 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1182562 | |
| Common | 188601 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 134098 | 11.3% |
| a | 117967 | 10.0% |
| r | 101199 | 8.6% |
| n | 83052 | 7.0% |
| s | 82888 | 7.0% |
| o | 79802 | 6.7% |
| t | 71848 | 6.1% |
| i | 60753 | 5.1% |
| l | 60225 | 5.1% |
| d | 54663 | 4.6% |
| Other values (44) | 336067 |
Common
| Value | Count | Frequency (%) |
| 155158 | ||
| , | 10235 | 5.4% |
| . | 7694 | 4.1% |
| + | 2456 | 1.3% |
| - | 1845 | 1.0% |
| 1 | 1217 | 0.6% |
| 0 | 1157 | 0.6% |
| 2 | 887 | 0.5% |
| ; | 838 | 0.4% |
| 5 | 750 | 0.4% |
| Other values (25) | 6364 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1371160 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 155158 | 11.3% | |
| e | 134098 | 9.8% |
| a | 117967 | 8.6% |
| r | 101199 | 7.4% |
| n | 83052 | 6.1% |
| s | 82888 | 6.0% |
| o | 79802 | 5.8% |
| t | 71848 | 5.2% |
| i | 60753 | 4.4% |
| l | 60225 | 4.4% |
| Other values (76) | 424170 |
None
| Value | Count | Frequency (%) |
| é | 1 | |
| ° | 1 | |
| ç | 1 |
samplingEffort
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 24.1667 |
|---|
| Value | Count | Frequency (%) |
| 24.1667 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Other Punctuation | 1 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 7 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
fieldNotes
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -110.283 |
|---|
| Value | Count | Frequency (%) |
| 110.283 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| - | 1 | |
| 0 | 1 | |
| . | 1 | |
| 2 | 1 | |
| 8 | 1 | |
| 3 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 1 | |
| 2 | 1 | |
| 8 | 1 | |
| 3 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| - | 1 | |
| 0 | 1 | |
| . | 1 | |
| 2 | 1 | |
| 8 | 1 | |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| - | 1 | |
| 0 | 1 | |
| . | 1 | |
| 2 | 1 | |
| 8 | 1 | |
| 3 | 1 |
locationID
Text
Missing 
| Distinct | 94703 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 984066 |
| Missing (%) | 51.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 37768 |
|---|---|
| Median length | 134 |
| Mean length | 4.4719158 |
| Min length | 1 |
Unique
| Unique | 52904 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | E4 |
|---|---|
| 2nd row | NR 12-4 ID 101 |
| 3rd row | 23 |
| 4th row | 1002 |
| 5th row | 2059 |
| Value | Count | Frequency (%) |
| not | 12392 | 1.2% |
| rec | 12070 | 1.2% |
| 4 | 8476 | 0.8% |
| rhb | 7696 | 0.7% |
| rfb | 7623 | 0.7% |
| 1 | 7614 | 0.7% |
| 2 | 6232 | 0.6% |
| 3 | 5496 | 0.5% |
| gs | 5168 | 0.5% |
| 6 | 5011 | 0.5% |
| Other values (80921) | 965661 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 474584 | 11.3% |
| 2 | 394541 | 9.4% |
| 0 | 331952 | 7.9% |
| 5 | 296061 | 7.0% |
| 3 | 287737 | 6.8% |
| 4 | 264333 | 6.3% |
| - | 262376 | 6.2% |
| 6 | 216672 | 5.1% |
| 7 | 190959 | 4.5% |
| 8 | 180969 | 4.3% |
| Other values (85) | 1313823 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2803373 | |
| Uppercase Letter | 884393 | 21.0% |
| Dash Punctuation | 262384 | 6.2% |
| Space Separator | 99106 | 2.4% |
| Other Punctuation | 75953 | 1.8% |
| Lowercase Letter | 66356 | 1.6% |
| Connector Punctuation | 8295 | 0.2% |
| Control | 6660 | 0.2% |
| Close Punctuation | 3380 | 0.1% |
| Open Punctuation | 3373 | 0.1% |
| Other values (2) | 734 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9590 | |
| o | 7889 | |
| r | 7537 | |
| a | 7245 | |
| i | 4117 | 6.2% |
| t | 3885 | 5.9% |
| l | 3658 | 5.5% |
| n | 2898 | 4.4% |
| c | 2771 | 4.2% |
| s | 2638 | 4.0% |
| Other values (18) | 14128 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 92320 | 10.4% |
| S | 79255 | 9.0% |
| C | 72043 | 8.1% |
| B | 66965 | 7.6% |
| R | 60474 | 6.8% |
| M | 57012 | 6.4% |
| N | 52437 | 5.9% |
| E | 48652 | 5.5% |
| I | 45116 | 5.1% |
| T | 37245 | 4.2% |
| Other values (17) | 272874 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37486 | |
| . | 24846 | |
| , | 7340 | 9.7% |
| / | 3833 | 5.0% |
| # | 1569 | 2.1% |
| & | 288 | 0.4% |
| ; | 175 | 0.2% |
| ? | 147 | 0.2% |
| * | 124 | 0.2% |
| ' | 117 | 0.2% |
| Other values (4) | 28 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 474584 | |
| 2 | 394541 | |
| 0 | 331952 | |
| 5 | 296061 | |
| 3 | 287737 | |
| 4 | 264333 | |
| 6 | 216672 | |
| 7 | 190959 | |
| 8 | 180969 | 6.5% |
| 9 | 165565 | 5.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3091 | |
| ] | 288 | 8.5% |
| } | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3084 | |
| [ | 288 | 8.5% |
| { | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 262376 | |
| – | 8 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 6630 | ||
| 30 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 724 | |
| = | 8 | 1.1% |
Other Number
| Value | Count | Frequency (%) |
| ₂ | 1 | |
| ₁ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 99106 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8295 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3263258 | |
| Latin | 950749 | 22.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 92320 | 9.7% |
| S | 79255 | 8.3% |
| C | 72043 | 7.6% |
| B | 66965 | 7.0% |
| R | 60474 | 6.4% |
| M | 57012 | 6.0% |
| N | 52437 | 5.5% |
| E | 48652 | 5.1% |
| I | 45116 | 4.7% |
| T | 37245 | 3.9% |
| Other values (45) | 339230 |
Common
| Value | Count | Frequency (%) |
| 1 | 474584 | |
| 2 | 394541 | |
| 0 | 331952 | |
| 5 | 296061 | |
| 3 | 287737 | |
| 4 | 264333 | |
| - | 262376 | |
| 6 | 216672 | |
| 7 | 190959 | |
| 8 | 180969 | 5.5% |
| Other values (30) | 363074 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4213988 | |
| None | 11 | < 0.1% |
| Punctuation | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 474584 | 11.3% |
| 2 | 394541 | 9.4% |
| 0 | 331952 | 7.9% |
| 5 | 296061 | 7.0% |
| 3 | 287737 | 6.8% |
| 4 | 264333 | 6.3% |
| - | 262376 | 6.2% |
| 6 | 216672 | 5.1% |
| 7 | 190959 | 4.5% |
| 8 | 180969 | 4.3% |
| Other values (79) | 1313804 |
Punctuation
| Value | Count | Frequency (%) |
| – | 8 |
None
| Value | Count | Frequency (%) |
| ö | 6 | |
| é | 2 | 18.2% |
| ₂ | 1 | 9.1% |
| É | 1 | 9.1% |
| ₁ | 1 | 9.1% |
higherGeography
Text
Missing 
| Distinct | 12370 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 67831 |
| Missing (%) | 3.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 126 |
|---|---|
| Median length | 104 |
| Mean length | 36.17342494 |
| Min length | 4 |
Unique
| Unique | 3190 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean, United States |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico, United States, Florida |
| 3rd row | North Atlantic Ocean, Caribbean Sea, Barbados |
| 4th row | North Atlantic Ocean, Gulf of Mexico, United States, Florida |
| 5th row | Philippines |
| Value | Count | Frequency (%) |
| ocean | 1259909 | 13.4% |
| north | 1098149 | 11.7% |
| united | 886190 | 9.4% |
| states | 871608 | 9.3% |
| atlantic | 718309 | 7.7% |
| pacific | 437003 | 4.7% |
| mexico | 248368 | 2.6% |
| of | 243369 | 2.6% |
| gulf | 228771 | 2.4% |
| south | 203325 | 2.2% |
| Other values (4652) | 3191450 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7527889 | 11.2% | |
| a | 6865365 | 10.2% |
| t | 6256807 | 9.3% |
| i | 4780196 | 7.1% |
| e | 4733947 | 7.0% |
| n | 4584442 | 6.8% |
| c | 3760391 | 5.6% |
| o | 2897132 | 4.3% |
| , | 2857287 | 4.2% |
| r | 2272065 | 3.4% |
| Other values (67) | 20695032 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47723213 | |
| Uppercase Letter | 9110940 | 13.6% |
| Space Separator | 7527889 | 11.2% |
| Other Punctuation | 2867453 | 4.3% |
| Dash Punctuation | 1038 | < 0.1% |
| Open Punctuation | 10 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6865365 | |
| t | 6256807 | |
| i | 4780196 | |
| e | 4733947 | |
| n | 4584442 | |
| c | 3760391 | |
| o | 2897132 | 6.1% |
| r | 2272065 | 4.8% |
| s | 2141009 | 4.5% |
| l | 1955194 | 4.1% |
| Other values (28) | 7476665 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1397107 | |
| O | 1301413 | |
| N | 1193009 | |
| A | 1063860 | |
| U | 893936 | |
| P | 682206 | |
| C | 555491 | 6.1% |
| M | 514537 | 5.6% |
| G | 305981 | 3.4% |
| F | 216163 | 2.4% |
| Other values (17) | 987237 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2857287 | |
| . | 7750 | 0.3% |
| ' | 2246 | 0.1% |
| ? | 153 | < 0.1% |
| & | 11 | < 0.1% |
| / | 6 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 | |
| [ | 2 | 20.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 | |
| ] | 2 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7527889 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1038 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56834153 | |
| Common | 10396400 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6865365 | |
| t | 6256807 | 11.0% |
| i | 4780196 | 8.4% |
| e | 4733947 | 8.3% |
| n | 4584442 | 8.1% |
| c | 3760391 | 6.6% |
| o | 2897132 | 5.1% |
| r | 2272065 | 4.0% |
| s | 2141009 | 3.8% |
| l | 1955194 | 3.4% |
| Other values (55) | 16587605 |
Common
| Value | Count | Frequency (%) |
| 7527889 | ||
| , | 2857287 | 27.5% |
| . | 7750 | 0.1% |
| ' | 2246 | < 0.1% |
| - | 1038 | < 0.1% |
| ? | 153 | < 0.1% |
| & | 11 | < 0.1% |
| ( | 8 | < 0.1% |
| ) | 8 | < 0.1% |
| / | 6 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67229602 | |
| None | 951 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7527889 | 11.2% | |
| a | 6865365 | 10.2% |
| t | 6256807 | 9.3% |
| i | 4780196 | 7.1% |
| e | 4733947 | 7.0% |
| n | 4584442 | 6.8% |
| c | 3760391 | 5.6% |
| o | 2897132 | 4.3% |
| , | 2857287 | 4.3% |
| r | 2272065 | 3.4% |
| Other values (54) | 20694081 |
None
| Value | Count | Frequency (%) |
| ç | 434 | |
| í | 144 | 15.1% |
| é | 141 | 14.8% |
| ó | 110 | 11.6% |
| á | 100 | 10.5% |
| ê | 7 | 0.7% |
| è | 6 | 0.6% |
| ô | 3 | 0.3% |
| ü | 2 | 0.2% |
| ñ | 1 | 0.1% |
| Other values (3) | 3 | 0.3% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1027391 |
| Missing (%) | 53.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 9.980899931 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | ASIA |
| 3rd row | NORTH_AMERICA |
| 4th row | OCEANIA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 475004 | |
| oceania | 155883 | 17.3% |
| asia | 135716 | 15.1% |
| south_america | 44254 | 4.9% |
| africa | 39371 | 4.4% |
| europe | 33879 | 3.8% |
| antarctica | 14895 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1745141 | |
| R | 1082407 | |
| I | 865123 | |
| C | 744302 | |
| E | 742899 | |
| O | 709020 | |
| N | 645782 | 7.2% |
| T | 549048 | 6.1% |
| H | 519258 | 5.8% |
| _ | 519258 | 5.8% |
| Other values (5) | 850611 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8453591 | |
| Connector Punctuation | 519258 | 5.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1745141 | |
| R | 1082407 | |
| I | 865123 | |
| C | 744302 | |
| E | 742899 | |
| O | 709020 | |
| N | 645782 | 7.6% |
| T | 549048 | 6.5% |
| H | 519258 | 6.1% |
| M | 519258 | 6.1% |
| Other values (4) | 331353 | 3.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 519258 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8453591 | |
| Common | 519258 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1745141 | |
| R | 1082407 | |
| I | 865123 | |
| C | 744302 | |
| E | 742899 | |
| O | 709020 | |
| N | 645782 | 7.6% |
| T | 549048 | 6.5% |
| H | 519258 | 6.1% |
| M | 519258 | 6.1% |
| Other values (4) | 331353 | 3.9% |
Common
| Value | Count | Frequency (%) |
| _ | 519258 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8972849 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1745141 | |
| R | 1082407 | |
| I | 865123 | |
| C | 744302 | |
| E | 742899 | |
| O | 709020 | |
| N | 645782 | 7.2% |
| T | 549048 | 6.1% |
| H | 519258 | 5.8% |
| _ | 519258 | 5.8% |
| Other values (5) | 850611 |
waterBody
Text
Missing 
| Distinct | 1655 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 666651 |
| Missing (%) | 34.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 75 |
| Mean length | 24.49184833 |
| Min length | 7 |
Unique
| Unique | 510 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico |
| 3rd row | North Atlantic Ocean, Caribbean Sea |
| 4th row | North Atlantic Ocean, Gulf of Mexico |
| 5th row | Antarctic Ocean |
| Value | Count | Frequency (%) |
| ocean | 1259434 | |
| north | 998553 | |
| atlantic | 718247 | |
| pacific | 436962 | 9.1% |
| of | 231313 | 4.8% |
| gulf | 228638 | 4.7% |
| sea | 193896 | 4.0% |
| mexico | 187756 | 3.9% |
| south | 160377 | 3.3% |
| caribbean | 89358 | 1.9% |
| Other values (1319) | 318010 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3562802 | ||
| c | 3175906 | |
| a | 3113538 | 10.1% |
| t | 2738941 | 8.9% |
| n | 2331622 | 7.6% |
| i | 2082746 | 6.8% |
| e | 1823700 | 5.9% |
| o | 1648330 | 5.3% |
| O | 1261125 | 4.1% |
| r | 1218140 | 3.9% |
| Other values (53) | 7896560 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22247025 | |
| Uppercase Letter | 4591399 | 14.9% |
| Space Separator | 3562802 | 11.5% |
| Other Punctuation | 451904 | 1.5% |
| Dash Punctuation | 276 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3175906 | |
| a | 3113538 | |
| t | 2738941 | |
| n | 2331622 | |
| i | 2082746 | |
| e | 1823700 | |
| o | 1648330 | |
| r | 1218140 | 5.5% |
| h | 1180475 | 5.3% |
| l | 988529 | 4.4% |
| Other values (20) | 1945098 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1261125 | |
| N | 1000300 | |
| A | 784279 | |
| P | 450444 | 9.8% |
| S | 386552 | 8.4% |
| G | 231863 | 5.0% |
| M | 210774 | 4.6% |
| C | 120701 | 2.6% |
| B | 53751 | 1.2% |
| I | 51181 | 1.1% |
| Other values (15) | 40429 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 451309 | |
| . | 465 | 0.1% |
| ' | 117 | < 0.1% |
| ? | 13 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3562802 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 276 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26838424 | |
| Common | 4014986 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 3175906 | |
| a | 3113538 | |
| t | 2738941 | |
| n | 2331622 | 8.7% |
| i | 2082746 | 7.8% |
| e | 1823700 | 6.8% |
| o | 1648330 | 6.1% |
| O | 1261125 | 4.7% |
| r | 1218140 | 4.5% |
| h | 1180475 | 4.4% |
| Other values (45) | 6263901 |
Common
| Value | Count | Frequency (%) |
| 3562802 | ||
| , | 451309 | 11.2% |
| . | 465 | < 0.1% |
| - | 276 | < 0.1% |
| ' | 117 | < 0.1% |
| ? | 13 | < 0.1% |
| [ | 2 | < 0.1% |
| ] | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30853307 | |
| None | 103 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3562802 | ||
| c | 3175906 | |
| a | 3113538 | 10.1% |
| t | 2738941 | 8.9% |
| n | 2331622 | 7.6% |
| i | 2082746 | 6.8% |
| e | 1823700 | 5.9% |
| o | 1648330 | 5.3% |
| O | 1261125 | 4.1% |
| r | 1218140 | 3.9% |
| Other values (49) | 7896457 |
None
| Value | Count | Frequency (%) |
| í | 48 | |
| á | 46 | |
| ó | 6 | 5.8% |
| è | 3 | 2.9% |
islandGroup
Text
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 1925623 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 14.52857143 |
| Min length | 5 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Society Islands |
|---|---|
| 2nd row | Society Islands |
| 3rd row | Society Islands |
| 4th row | Society Islands |
| 5th row | Society Islands |
| Value | Count | Frequency (%) |
| islands | 707 | |
| society | 679 | |
| exuma | 20 | 1.3% |
| south | 12 | 0.8% |
| sandwich | 12 | 0.8% |
| florida | 10 | 0.7% |
| keys | 10 | 0.7% |
| pacific | 10 | 0.7% |
| carolina | 8 | 0.5% |
| aleutian | 7 | 0.5% |
| Other values (14) | 28 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.2% |
| l | 751 | 6.7% |
| n | 748 | 6.7% |
| i | 743 | 6.6% |
| d | 738 | 6.6% |
| 733 | 6.6% | |
| o | 722 | 6.5% |
| c | 713 | 6.4% |
| e | 711 | 6.4% |
| Other values (25) | 3079 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8951 | |
| Uppercase Letter | 1503 | 13.4% |
| Space Separator | 733 | 6.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | |
| l | 751 | |
| n | 748 | |
| i | 743 | |
| d | 738 | |
| o | 722 | |
| c | 713 | |
| e | 711 | |
| t | 699 | |
| Other values (11) | 877 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 710 | |
| S | 703 | |
| E | 21 | 1.4% |
| C | 16 | 1.1% |
| P | 12 | 0.8% |
| F | 10 | 0.7% |
| K | 10 | 0.7% |
| A | 7 | 0.5% |
| M | 6 | 0.4% |
| R | 2 | 0.1% |
| Other values (3) | 6 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 733 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10454 | |
| Common | 733 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.7% |
| l | 751 | 7.2% |
| n | 748 | 7.2% |
| i | 743 | 7.1% |
| d | 738 | 7.1% |
| o | 722 | 6.9% |
| c | 713 | 6.8% |
| e | 711 | 6.8% |
| I | 710 | 6.8% |
| Other values (24) | 2369 |
Common
| Value | Count | Frequency (%) |
| 733 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.2% |
| l | 751 | 6.7% |
| n | 748 | 6.7% |
| i | 743 | 6.6% |
| d | 738 | 6.6% |
| 733 | 6.6% | |
| o | 722 | 6.5% |
| c | 713 | 6.4% |
| e | 711 | 6.4% |
| Other values (25) | 3079 |
island
Text
Missing 
| Distinct | 58 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 1925415 |
| Missing (%) | 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 6 |
| Mean length | 6.676891616 |
| Min length | 4 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | Moorea |
|---|---|
| 2nd row | Moorea |
| 3rd row | Shikoku |
| 4th row | Oahu |
| 5th row | Moorea |
| Value | Count | Frequency (%) |
| moorea | 674 | |
| oahu | 147 | 13.2% |
| island | 91 | 8.2% |
| great | 20 | 1.8% |
| exuma | 20 | 1.8% |
| nunivak | 13 | 1.2% |
| eniwetok | 13 | 1.2% |
| bonaire | 11 | 1.0% |
| key | 10 | 0.9% |
| west | 10 | 0.9% |
| Other values (58) | 106 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.4% |
| n | 186 | 2.8% |
| h | 170 | 2.6% |
| O | 154 | 2.4% |
| 137 | 2.1% | |
| Other values (39) | 977 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5279 | |
| Uppercase Letter | 1113 | 17.0% |
| Space Separator | 137 | 2.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| u | 225 | 4.3% |
| n | 186 | 3.5% |
| h | 170 | 3.2% |
| s | 121 | 2.3% |
| d | 107 | 2.0% |
| l | 105 | 2.0% |
| Other values (16) | 367 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 683 | |
| O | 154 | 13.8% |
| I | 90 | 8.1% |
| E | 35 | 3.1% |
| G | 23 | 2.1% |
| K | 21 | 1.9% |
| N | 19 | 1.7% |
| S | 19 | 1.7% |
| B | 17 | 1.5% |
| R | 11 | 1.0% |
| Other values (11) | 41 | 3.7% |
Space Separator
| Value | Count | Frequency (%) |
| 137 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6392 | |
| Common | 138 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.5% |
| n | 186 | 2.9% |
| h | 170 | 2.7% |
| O | 154 | 2.4% |
| s | 121 | 1.9% |
| Other values (37) | 855 |
Common
| Value | Count | Frequency (%) |
| 137 | ||
| . | 1 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6528 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.4% |
| n | 186 | 2.8% |
| h | 170 | 2.6% |
| O | 154 | 2.4% |
| 137 | 2.1% | |
| Other values (38) | 975 |
None
| Value | Count | Frequency (%) |
| á | 2 |
countryCode
Text
Missing 
| Distinct | 239 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110759 |
| Missing (%) | 5.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | BB |
| 4th row | US |
| 5th row | PH |
| Value | Count | Frequency (%) |
| us | 868583 | |
| ph | 93802 | 5.2% |
| mx | 59371 | 3.3% |
| pa | 46369 | 2.6% |
| aq | 44802 | 2.5% |
| jp | 38538 | 2.1% |
| cu | 30147 | 1.7% |
| ca | 28674 | 1.6% |
| jm | 27586 | 1.5% |
| pf | 27226 | 1.5% |
| Other values (229) | 550536 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 948161 | |
| S | 926976 | |
| P | 250779 | 6.9% |
| A | 177982 | 4.9% |
| M | 160911 | 4.4% |
| H | 143259 | 3.9% |
| C | 133182 | 3.7% |
| B | 95322 | 2.6% |
| J | 78390 | 2.2% |
| G | 66596 | 1.8% |
| Other values (16) | 649710 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3631268 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 948161 | |
| S | 926976 | |
| P | 250779 | 6.9% |
| A | 177982 | 4.9% |
| M | 160911 | 4.4% |
| H | 143259 | 3.9% |
| C | 133182 | 3.7% |
| B | 95322 | 2.6% |
| J | 78390 | 2.2% |
| G | 66596 | 1.8% |
| Other values (16) | 649710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3631268 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 948161 | |
| S | 926976 | |
| P | 250779 | 6.9% |
| A | 177982 | 4.9% |
| M | 160911 | 4.4% |
| H | 143259 | 3.9% |
| C | 133182 | 3.7% |
| B | 95322 | 2.6% |
| J | 78390 | 2.2% |
| G | 66596 | 1.8% |
| Other values (16) | 649710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3631268 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 948161 | |
| S | 926976 | |
| P | 250779 | 6.9% |
| A | 177982 | 4.9% |
| M | 160911 | 4.4% |
| H | 143259 | 3.9% |
| C | 133182 | 3.7% |
| B | 95322 | 2.6% |
| J | 78390 | 2.2% |
| G | 66596 | 1.8% |
| Other values (16) | 649710 |
stateProvince
Text
Missing 
| Distinct | 1326 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 943673 |
| Missing (%) | 49.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 39 |
| Mean length | 9.182679705 |
| Min length | 3 |
Unique
| Unique | 281 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Florida |
| 3rd row | Massachusetts |
| 4th row | Quezon |
| 5th row | Newfoundland |
| Value | Count | Frequency (%) |
| florida | 157981 | 13.1% |
| massachusetts | 103383 | 8.6% |
| california | 57085 | 4.7% |
| carolina | 53929 | 4.5% |
| texas | 43591 | 3.6% |
| alaska | 41859 | 3.5% |
| north | 31994 | 2.7% |
| louisiana | 28645 | 2.4% |
| hawaii | 26401 | 2.2% |
| south | 26211 | 2.2% |
| Other values (1250) | 635019 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1427949 | |
| i | 809015 | 9.0% |
| s | 773254 | 8.6% |
| o | 650882 | 7.2% |
| r | 519439 | 5.8% |
| l | 506660 | 5.6% |
| n | 498668 | 5.5% |
| e | 457618 | 5.1% |
| t | 400633 | 4.4% |
| u | 277325 | 3.1% |
| Other values (60) | 2702560 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7611777 | |
| Uppercase Letter | 1183253 | 13.1% |
| Space Separator | 223378 | 2.5% |
| Other Punctuation | 5089 | 0.1% |
| Dash Punctuation | 489 | < 0.1% |
| Open Punctuation | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1427949 | |
| i | 809015 | |
| s | 773254 | |
| o | 650882 | |
| r | 519439 | 6.8% |
| l | 506660 | 6.7% |
| n | 498668 | 6.6% |
| e | 457618 | 6.0% |
| t | 400633 | 5.3% |
| u | 277325 | 3.6% |
| Other values (24) | 1290334 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 171151 | |
| C | 165212 | |
| F | 164699 | |
| A | 80869 | 6.8% |
| N | 78781 | 6.7% |
| T | 76132 | 6.4% |
| S | 72421 | 6.1% |
| I | 44681 | 3.8% |
| G | 38393 | 3.2% |
| L | 36085 | 3.0% |
| Other values (17) | 254829 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4593 | |
| . | 302 | 5.9% |
| ' | 148 | 2.9% |
| ? | 46 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 223378 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 489 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8795030 | |
| Common | 228973 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1427949 | |
| i | 809015 | 9.2% |
| s | 773254 | 8.8% |
| o | 650882 | 7.4% |
| r | 519439 | 5.9% |
| l | 506660 | 5.8% |
| n | 498668 | 5.7% |
| e | 457618 | 5.2% |
| t | 400633 | 4.6% |
| u | 277325 | 3.2% |
| Other values (51) | 2473587 |
Common
| Value | Count | Frequency (%) |
| 223378 | ||
| , | 4593 | 2.0% |
| - | 489 | 0.2% |
| . | 302 | 0.1% |
| ' | 148 | 0.1% |
| ? | 46 | < 0.1% |
| ( | 8 | < 0.1% |
| ) | 8 | < 0.1% |
| | | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9023619 | |
| None | 384 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1427949 | |
| i | 809015 | 9.0% |
| s | 773254 | 8.6% |
| o | 650882 | 7.2% |
| r | 519439 | 5.8% |
| l | 506660 | 5.6% |
| n | 498668 | 5.5% |
| e | 457618 | 5.1% |
| t | 400633 | 4.4% |
| u | 277325 | 3.1% |
| Other values (51) | 2702176 |
None
| Value | Count | Frequency (%) |
| é | 123 | |
| ó | 101 | |
| í | 96 | |
| á | 52 | |
| ê | 7 | 1.8% |
| è | 2 | 0.5% |
| Ñ | 1 | 0.3% |
| ú | 1 | 0.3% |
| ô | 1 | 0.3% |
county
Text
Missing 
| Distinct | 2594 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1786420 |
| Missing (%) | 92.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 43 |
| Mean length | 14.35974795 |
| Min length | 3 |
Unique
| Unique | 558 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Cumberland County |
|---|---|
| 2nd row | Allamakee County |
| 3rd row | St. Lucie County |
| 4th row | Delaware County |
| 5th row | Kimble County |
| Value | Count | Frequency (%) |
| county | 135423 | |
| st | 3893 | 1.3% |
| parish | 3203 | 1.1% |
| monroe | 3117 | 1.0% |
| lucie | 2649 | 0.9% |
| montgomery | 2553 | 0.9% |
| san | 2117 | 0.7% |
| prince | 1875 | 0.6% |
| george's | 1763 | 0.6% |
| jackson | 1748 | 0.6% |
| Other values (2256) | 139876 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 223770 | |
| o | 216846 | |
| t | 181049 | 9.0% |
| u | 160924 | 8.0% |
| 158244 | 7.9% | |
| C | 152414 | 7.6% |
| y | 151819 | 7.6% |
| e | 105735 | 5.3% |
| a | 103265 | 5.1% |
| r | 74023 | 3.7% |
| Other values (55) | 481888 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1547182 | |
| Uppercase Letter | 298415 | 14.8% |
| Space Separator | 158244 | 7.9% |
| Other Punctuation | 5911 | 0.3% |
| Dash Punctuation | 225 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 223770 | |
| o | 216846 | |
| t | 181049 | |
| u | 160924 | |
| y | 151819 | |
| e | 105735 | |
| a | 103265 | |
| r | 74023 | 4.8% |
| i | 55529 | 3.6% |
| l | 50155 | 3.2% |
| Other values (22) | 224067 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 152414 | |
| M | 16357 | 5.5% |
| S | 14112 | 4.7% |
| L | 13053 | 4.4% |
| P | 12734 | 4.3% |
| B | 11994 | 4.0% |
| G | 8960 | 3.0% |
| W | 8635 | 2.9% |
| A | 8280 | 2.8% |
| D | 7831 | 2.6% |
| Other values (16) | 44045 | 14.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3891 | |
| ' | 1979 | |
| , | 24 | 0.4% |
| & | 11 | 0.2% |
| / | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 158244 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1845597 | |
| Common | 164380 | 8.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 223770 | |
| o | 216846 | |
| t | 181049 | |
| u | 160924 | 8.7% |
| C | 152414 | 8.3% |
| y | 151819 | 8.2% |
| e | 105735 | 5.7% |
| a | 103265 | 5.6% |
| r | 74023 | 4.0% |
| i | 55529 | 3.0% |
| Other values (48) | 420223 |
Common
| Value | Count | Frequency (%) |
| 158244 | ||
| . | 3891 | 2.4% |
| ' | 1979 | 1.2% |
| - | 225 | 0.1% |
| , | 24 | < 0.1% |
| & | 11 | < 0.1% |
| / | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2009968 | |
| None | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 223770 | |
| o | 216846 | |
| t | 181049 | 9.0% |
| u | 160924 | 8.0% |
| 158244 | 7.9% | |
| C | 152414 | 7.6% |
| y | 151819 | 7.6% |
| e | 105735 | 5.3% |
| a | 103265 | 5.1% |
| r | 74023 | 3.7% |
| Other values (49) | 481879 |
None
| Value | Count | Frequency (%) |
| ó | 3 | |
| ü | 2 | |
| ñ | 1 | 11.1% |
| ç | 1 | 11.1% |
| ø | 1 | 11.1% |
| è | 1 | 11.1% |
locality
Text
Missing 
| Distinct | 204742 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 642386 |
| Missing (%) | 33.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 21793 |
|---|---|
| Median length | 378 |
| Mean length | 29.00482474 |
| Min length | 1 |
Unique
| Unique | 126316 ? |
|---|---|
| Unique (%) | 9.8% |
Sample
| 1st row | off Delaware |
|---|---|
| 2nd row | W Coast |
| 3rd row | Cape Sable, West Of |
| 4th row | Antarctic Peninsula |
| 5th row | Georges Bank |
| Value | Count | Frequency (%) |
| island | 342357 | 5.6% |
| of | 336472 | 5.5% |
| off | 252665 | 4.1% |
| bay | 137534 | 2.2% |
| islands | 98147 | 1.6% |
| bank | 84597 | 1.4% |
| south | 74630 | 1.2% |
| georges | 66663 | 1.1% |
| florida | 63432 | 1.0% |
| river | 63370 | 1.0% |
| Other values (77326) | 4636608 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4869900 | 13.1% | |
| a | 3498938 | 9.4% |
| e | 2451391 | 6.6% |
| o | 2297059 | 6.2% |
| n | 2155175 | 5.8% |
| r | 1674733 | 4.5% |
| s | 1629255 | 4.4% |
| i | 1598121 | 4.3% |
| l | 1584743 | 4.3% |
| t | 1476204 | 4.0% |
| Other values (129) | 14006879 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25267159 | |
| Uppercase Letter | 5379884 | 14.4% |
| Space Separator | 4869900 | 13.1% |
| Other Punctuation | 1210117 | 3.2% |
| Decimal Number | 428539 | 1.2% |
| Dash Punctuation | 41585 | 0.1% |
| Open Punctuation | 15189 | < 0.1% |
| Close Punctuation | 15060 | < 0.1% |
| Control | 8574 | < 0.1% |
| Math Symbol | 5030 | < 0.1% |
| Other values (7) | 1361 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3498938 | |
| e | 2451391 | |
| o | 2297059 | 9.1% |
| n | 2155175 | 8.5% |
| r | 1674733 | 6.6% |
| s | 1629255 | 6.4% |
| i | 1598121 | 6.3% |
| l | 1584743 | 6.3% |
| t | 1476204 | 5.8% |
| d | 1018124 | 4.0% |
| Other values (49) | 5883416 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 540440 | 10.0% |
| I | 502201 | 9.3% |
| B | 476049 | 8.8% |
| C | 467229 | 8.7% |
| O | 360411 | 6.7% |
| P | 312954 | 5.8% |
| M | 279912 | 5.2% |
| R | 263111 | 4.9% |
| L | 254938 | 4.7% |
| A | 251406 | 4.7% |
| Other values (19) | 1671233 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 987016 | |
| . | 147443 | 12.2% |
| ' | 31555 | 2.6% |
| ; | 24458 | 2.0% |
| / | 8141 | 0.7% |
| # | 2753 | 0.2% |
| : | 2526 | 0.2% |
| & | 2525 | 0.2% |
| " | 2473 | 0.2% |
| ? | 1188 | 0.1% |
| Other values (6) | 39 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 83368 | |
| 0 | 71352 | |
| 2 | 58104 | |
| 5 | 50075 | |
| 3 | 40076 | |
| 4 | 32432 | 7.6% |
| 6 | 30840 | 7.2% |
| 7 | 22387 | 5.2% |
| 8 | 20826 | 4.9% |
| 9 | 19079 | 4.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4129 | |
| > | 403 | 8.0% |
| = | 370 | 7.4% |
| ~ | 121 | 2.4% |
| < | 3 | 0.1% |
| | | 2 | < 0.1% |
| ± | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 14382 | |
| [ | 789 | 5.2% |
| { | 18 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 14274 | |
| ] | 776 | 5.2% |
| } | 10 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41584 | |
| – | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 8535 | ||
| 39 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 4869900 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 762 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 587 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 6 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30647043 | |
| Common | 6595355 | 17.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3498938 | 11.4% |
| e | 2451391 | 8.0% |
| o | 2297059 | 7.5% |
| n | 2155175 | 7.0% |
| r | 1674733 | 5.5% |
| s | 1629255 | 5.3% |
| i | 1598121 | 5.2% |
| l | 1584743 | 5.2% |
| t | 1476204 | 4.8% |
| d | 1018124 | 3.3% |
| Other values (78) | 11263300 |
Common
| Value | Count | Frequency (%) |
| 4869900 | ||
| , | 987016 | 15.0% |
| . | 147443 | 2.2% |
| 1 | 83368 | 1.3% |
| 0 | 71352 | 1.1% |
| 2 | 58104 | 0.9% |
| 5 | 50075 | 0.8% |
| - | 41584 | 0.6% |
| 3 | 40076 | 0.6% |
| 4 | 32432 | 0.5% |
| Other values (41) | 214005 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37240432 | |
| None | 1960 | < 0.1% |
| Modifier Letters | 3 | < 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4869900 | 13.1% | |
| a | 3498938 | 9.4% |
| e | 2451391 | 6.6% |
| o | 2297059 | 6.2% |
| n | 2155175 | 5.8% |
| r | 1674733 | 4.5% |
| s | 1629255 | 4.4% |
| i | 1598121 | 4.3% |
| l | 1584743 | 4.3% |
| t | 1476204 | 4.0% |
| Other values (86) | 14004913 |
None
| Value | Count | Frequency (%) |
| ° | 762 | |
| é | 230 | 11.7% |
| ã | 187 | 9.5% |
| á | 141 | 7.2% |
| ó | 138 | 7.0% |
| í | 109 | 5.6% |
| ñ | 78 | 4.0% |
| ú | 55 | 2.8% |
| ç | 36 | 1.8% |
| ī | 36 | 1.8% |
| Other values (29) | 188 | 9.6% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 3 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 | |
| “ | 1 | |
| – | 1 |
Missing 
| Distinct | 126 |
|---|---|
| Distinct (%) | 27.3% |
| Missing | 1925931 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 4 |
| Mean length | 10.17099567 |
| Min length | 4 |
Unique
| Unique | 65 ? |
|---|---|
| Unique (%) | 14.1% |
Sample
| 1st row | 7000 |
|---|---|
| 2nd row | 4070 m.a.s.l. |
| 3rd row | 4200-4400 |
| 4th row | 2009 +/- 20.1 feet |
| 5th row | 3000 |
| Value | Count | Frequency (%) |
| collected | 53 | 5.6% |
| on | 53 | 5.6% |
| and | 51 | 5.4% |
| flat | 50 | 5.3% |
| lagoon | 50 | 5.3% |
| slope | 50 | 5.3% |
| m | 27 | 2.8% |
| 3800 | 23 | 2.4% |
| 2550 | 21 | 2.2% |
| above | 19 | 2.0% |
| Other values (148) | 554 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | 10.4% | |
| l | 346 | 7.4% |
| e | 330 | 7.0% |
| o | 320 | 6.8% |
| a | 237 | 5.0% |
| 3 | 219 | 4.7% |
| 5 | 218 | 4.6% |
| t | 202 | 4.3% |
| n | 193 | 4.1% |
| Other values (41) | 1485 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2418 | |
| Decimal Number | 1576 | |
| Space Separator | 489 | 10.4% |
| Other Punctuation | 72 | 1.5% |
| Uppercase Letter | 70 | 1.5% |
| Dash Punctuation | 34 | 0.7% |
| Open Punctuation | 17 | 0.4% |
| Close Punctuation | 17 | 0.4% |
| Math Symbol | 6 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 346 | |
| e | 330 | |
| o | 320 | |
| a | 237 | |
| t | 202 | |
| n | 193 | |
| s | 124 | 5.1% |
| d | 118 | 4.9% |
| f | 84 | 3.5% |
| m | 80 | 3.3% |
| Other values (13) | 384 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 3 | 219 | 13.9% |
| 5 | 218 | 13.8% |
| 2 | 127 | 8.1% |
| 4 | 98 | 6.2% |
| 8 | 74 | 4.7% |
| 1 | 68 | 4.3% |
| 7 | 48 | 3.0% |
| 9 | 43 | 2.7% |
| 6 | 21 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 40 | |
| ' | 20 | |
| ? | 6 | 8.3% |
| , | 3 | 4.2% |
| / | 2 | 2.8% |
| ; | 1 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 51 | |
| E | 15 | 21.4% |
| A | 2 | 2.9% |
| I | 1 | 1.4% |
| T | 1 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 3 | |
| + | 2 | |
| > | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 489 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2488 | |
| Common | 2211 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 346 | |
| e | 330 | |
| o | 320 | |
| a | 237 | |
| t | 202 | |
| n | 193 | |
| s | 124 | 5.0% |
| d | 118 | 4.7% |
| f | 84 | 3.4% |
| m | 80 | 3.2% |
| Other values (18) | 454 |
Common
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | ||
| 3 | 219 | 9.9% |
| 5 | 218 | 9.9% |
| 2 | 127 | 5.7% |
| 4 | 98 | 4.4% |
| 8 | 74 | 3.3% |
| 1 | 68 | 3.1% |
| 7 | 48 | 2.2% |
| 9 | 43 | 1.9% |
| Other values (13) | 167 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4699 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | 10.4% | |
| l | 346 | 7.4% |
| e | 330 | 7.0% |
| o | 320 | 6.8% |
| a | 237 | 5.0% |
| 3 | 219 | 4.7% |
| 5 | 218 | 4.6% |
| t | 202 | 4.3% |
| n | 193 | 4.1% |
| Other values (41) | 1485 |
verbatimDepth
Text
Missing 
| Distinct | 1530 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 1900149 |
| Missing (%) | 98.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 99 |
|---|---|
| Median length | 91 |
| Mean length | 13.43716659 |
| Min length | 1 |
Unique
| Unique | 721 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | Surface |
|---|---|
| 2nd row | max depth 1772 ft |
| 3rd row | surface |
| 4th row | Intertidal |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| intertidal | 11932 | |
| surface | 4085 | 8.0% |
| recorded | 2871 | 5.6% |
| depths | 2850 | 5.6% |
| multiple | 2846 | 5.6% |
| shore | 1165 | 2.3% |
| 0-300 | 1120 | 2.2% |
| 0 | 1069 | 2.1% |
| depth | 1023 | 2.0% |
| low | 964 | 1.9% |
| Other values (1043) | 21003 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 36687 | 10.4% |
| e | 35142 | 10.0% |
| r | 25391 | 7.2% |
| 24684 | 7.0% | |
| d | 24177 | 6.9% |
| l | 20651 | 5.9% |
| a | 20481 | 5.8% |
| i | 19392 | 5.5% |
| 0 | 16029 | 4.5% |
| n | 14727 | 4.2% |
| Other values (69) | 115284 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 250901 | |
| Decimal Number | 39304 | 11.1% |
| Space Separator | 24684 | 7.0% |
| Uppercase Letter | 19960 | 5.7% |
| Other Punctuation | 12440 | 3.5% |
| Dash Punctuation | 4883 | 1.4% |
| Math Symbol | 236 | 0.1% |
| Open Punctuation | 118 | < 0.1% |
| Close Punctuation | 118 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 36687 | |
| e | 35142 | |
| r | 25391 | |
| d | 24177 | |
| l | 20651 | |
| a | 20481 | |
| i | 19392 | |
| n | 14727 | 5.9% |
| c | 8176 | 3.3% |
| p | 7645 | 3.0% |
| Other values (15) | 38432 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 10834 | |
| S | 4489 | |
| M | 2986 | 15.0% |
| L | 758 | 3.8% |
| T | 218 | 1.1% |
| B | 109 | 0.5% |
| H | 83 | 0.4% |
| D | 78 | 0.4% |
| C | 73 | 0.4% |
| Z | 59 | 0.3% |
| Other values (14) | 273 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5999 | |
| : | 3688 | |
| . | 1398 | 11.2% |
| " | 841 | 6.8% |
| ; | 207 | 1.7% |
| ' | 201 | 1.6% |
| @ | 43 | 0.3% |
| / | 29 | 0.2% |
| & | 22 | 0.2% |
| ? | 10 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16029 | |
| 1 | 4890 | 12.4% |
| 2 | 3729 | 9.5% |
| 3 | 3379 | 8.6% |
| 5 | 2940 | 7.5% |
| 8 | 2556 | 6.5% |
| 4 | 1747 | 4.4% |
| 6 | 1723 | 4.4% |
| 7 | 1433 | 3.6% |
| 9 | 878 | 2.2% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 138 | |
| = | 60 | |
| + | 24 | 10.2% |
| ~ | 14 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 24684 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4883 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 118 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 118 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 270861 | |
| Common | 81784 | 23.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 36687 | |
| e | 35142 | |
| r | 25391 | |
| d | 24177 | |
| l | 20651 | 7.6% |
| a | 20481 | 7.6% |
| i | 19392 | 7.2% |
| n | 14727 | 5.4% |
| I | 10834 | 4.0% |
| c | 8176 | 3.0% |
| Other values (39) | 55203 |
Common
| Value | Count | Frequency (%) |
| 24684 | ||
| 0 | 16029 | |
| , | 5999 | 7.3% |
| 1 | 4890 | 6.0% |
| - | 4883 | 6.0% |
| 2 | 3729 | 4.6% |
| : | 3688 | 4.5% |
| 3 | 3379 | 4.1% |
| 5 | 2940 | 3.6% |
| 8 | 2556 | 3.1% |
| Other values (20) | 9007 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 352644 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 36687 | 10.4% |
| e | 35142 | 10.0% |
| r | 25391 | 7.2% |
| 24684 | 7.0% | |
| d | 24177 | 6.9% |
| l | 20651 | 5.9% |
| a | 20481 | 5.8% |
| i | 19392 | 5.5% |
| 0 | 16029 | 4.5% |
| n | 14727 | 4.2% |
| Other values (68) | 115283 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
decimalLatitude
Text
Missing 
| Distinct | 70087 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 927346 |
| Missing (%) | 48.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.236377268 |
| Min length | 3 |
Unique
| Unique | 26230 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | 38.7117 |
|---|---|
| 2nd row | 25.2819 |
| 3rd row | -62.667 |
| 4th row | 42.0833 |
| 5th row | 13.7792 |
| Value | Count | Frequency (%) |
| 25.58 | 10489 | 1.0% |
| 40.6583 | 8821 | 0.9% |
| 26.17 | 7320 | 0.7% |
| 26.5 | 5196 | 0.5% |
| 26.97 | 3956 | 0.4% |
| 25.7883 | 3457 | 0.3% |
| 9.4 | 3109 | 0.3% |
| 9.37 | 2978 | 0.3% |
| 40.895 | 2590 | 0.3% |
| 40.66 | 2520 | 0.3% |
| Other values (65558) | 948611 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 999047 | |
| 3 | 788192 | |
| 2 | 616242 | |
| 5 | 525610 | |
| 7 | 524945 | |
| 4 | 501558 | |
| 1 | 480743 | |
| 6 | 474865 | |
| 8 | 472201 | |
| 9 | 377091 | 6.1% |
| Other values (3) | 469940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5078800 | |
| Other Punctuation | 999047 | 16.0% |
| Dash Punctuation | 152586 | 2.4% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 788192 | |
| 2 | 616242 | |
| 5 | 525610 | |
| 7 | 524945 | |
| 4 | 501558 | |
| 1 | 480743 | |
| 6 | 474865 | |
| 8 | 472201 | |
| 9 | 377091 | |
| 0 | 317353 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 999047 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 152586 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6230433 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 999047 | |
| 3 | 788192 | |
| 2 | 616242 | |
| 5 | 525610 | |
| 7 | 524945 | |
| 4 | 501558 | |
| 1 | 480743 | |
| 6 | 474865 | |
| 8 | 472201 | |
| 9 | 377091 | 6.1% |
| Other values (2) | 469939 |
Latin
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6230434 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 999047 | |
| 3 | 788192 | |
| 2 | 616242 | |
| 5 | 525610 | |
| 7 | 524945 | |
| 4 | 501558 | |
| 1 | 480743 | |
| 6 | 474865 | |
| 8 | 472201 | |
| 9 | 377091 | 6.1% |
| Other values (3) | 469940 |
decimalLongitude
Text
Missing 
| Distinct | 74625 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 927346 |
| Missing (%) | 48.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 7.110920707 |
| Min length | 3 |
Unique
| Unique | 27280 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | -73.405 |
|---|---|
| 2nd row | -83.6297 |
| 3rd row | -54.742 |
| 4th row | -66.7708 |
| 5th row | 121.586 |
| Value | Count | Frequency (%) |
| 80.1 | 10529 | 1.1% |
| 127.848 | 4532 | 0.5% |
| 67.7683 | 4213 | 0.4% |
| 80.13 | 3738 | 0.4% |
| 82.7 | 3518 | 0.4% |
| 67.77 | 2821 | 0.3% |
| 66.775 | 2592 | 0.3% |
| 81.6633 | 2462 | 0.2% |
| 70.6731 | 2397 | 0.2% |
| 67.755 | 2356 | 0.2% |
| Other values (69839) | 959889 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 999047 | |
| - | 826266 | |
| 7 | 744654 | |
| 8 | 682740 | |
| 1 | 674897 | |
| 6 | 575520 | |
| 3 | 562337 | |
| 2 | 472623 | |
| 5 | 432907 | |
| 9 | 409886 | |
| Other values (2) | 723267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5278831 | |
| Other Punctuation | 999047 | 14.1% |
| Dash Punctuation | 826266 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 744654 | |
| 8 | 682740 | |
| 1 | 674897 | |
| 6 | 575520 | |
| 3 | 562337 | |
| 2 | 472623 | |
| 5 | 432907 | |
| 9 | 409886 | |
| 0 | 371211 | |
| 4 | 352056 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 999047 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 826266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7104144 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 999047 | |
| - | 826266 | |
| 7 | 744654 | |
| 8 | 682740 | |
| 1 | 674897 | |
| 6 | 575520 | |
| 3 | 562337 | |
| 2 | 472623 | |
| 5 | 432907 | |
| 9 | 409886 | |
| Other values (2) | 723267 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7104144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 999047 | |
| - | 826266 | |
| 7 | 744654 | |
| 8 | 682740 | |
| 1 | 674897 | |
| 6 | 575520 | |
| 3 | 562337 | |
| 2 | 472623 | |
| 5 | 432907 | |
| 9 | 409886 | |
| Other values (2) | 723267 |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1246885 |
| Missing (%) | 64.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.60567057 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 670900 | |
| minutes | 648195 | |
| seconds | 648195 | |
| decimal | 22705 | 1.1% |
| township | 7004 | 0.3% |
| range | 7004 | 0.3% |
| marsden | 605 | < 0.1% |
| square | 605 | < 0.1% |
| unknown | 532 | < 0.1% |
| utm | 464 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3340010 | |
| s | 1974899 | |
| 1326707 | 8.6% | |
| n | 1312599 | 8.5% |
| g | 677904 | 4.4% |
| i | 677904 | 4.4% |
| r | 672113 | 4.4% |
| d | 671463 | 4.4% |
| D | 670945 | 4.4% |
| c | 670901 | 4.4% |
| Other values (20) | 3365289 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12049540 | |
| Uppercase Letter | 1984487 | 12.9% |
| Space Separator | 1326707 | 8.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3340010 | |
| s | 1974899 | |
| n | 1312599 | 10.9% |
| g | 677904 | 5.6% |
| i | 677904 | 5.6% |
| r | 672113 | 5.6% |
| d | 671463 | 5.6% |
| c | 670901 | 5.6% |
| o | 655733 | 5.4% |
| u | 648803 | 5.4% |
| Other values (9) | 747211 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 670945 | |
| M | 649264 | |
| S | 648800 | |
| T | 7468 | 0.4% |
| R | 7004 | 0.4% |
| U | 998 | 0.1% |
| Q | 3 | < 0.1% |
| A | 2 | < 0.1% |
| F | 2 | < 0.1% |
| G | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1326707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14034027 | |
| Common | 1326707 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3340010 | |
| s | 1974899 | |
| n | 1312599 | 9.4% |
| g | 677904 | 4.8% |
| i | 677904 | 4.8% |
| r | 672113 | 4.8% |
| d | 671463 | 4.8% |
| D | 670945 | 4.8% |
| c | 670901 | 4.8% |
| o | 655733 | 4.7% |
| Other values (19) | 2709556 |
Common
| Value | Count | Frequency (%) |
| 1326707 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15360734 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3340010 | |
| s | 1974899 | |
| 1326707 | 8.6% | |
| n | 1312599 | 8.5% |
| g | 677904 | 4.4% |
| i | 677904 | 4.4% |
| r | 672113 | 4.4% |
| d | 671463 | 4.4% |
| D | 670945 | 4.4% |
| c | 670901 | 4.4% |
| Other values (20) | 3365289 |
verbatimSRS
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1936-08-14 |
|---|---|
| 2nd row | 1926-08-24 |
| Value | Count | Frequency (%) |
| 1936-08-14 | 1 | |
| 1926-08-24 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 6 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 4 | 2 | |
| 2 | 2 | |
| 3 | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16 | |
| Dash Punctuation | 4 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 9 | 2 | |
| 6 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 4 | 2 | |
| 2 | 2 | |
| 3 | 1 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 6 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 4 | 2 | |
| 2 | 2 | |
| 3 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 6 | 2 | |
| 0 | 2 | |
| 8 | 2 | |
| 4 | 2 | |
| 2 | 2 | |
| 3 | 1 | 5.0% |
footprintSRS
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 227 |
|---|---|
| 2nd row | 236 |
| Value | Count | Frequency (%) |
| 227 | 1 | |
| 236 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 227 |
|---|---|
| 2nd row | 236 |
| Value | Count | Frequency (%) |
| 227 | 1 | |
| 236 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
georeferencedBy
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1936 |
|---|---|
| 2nd row | 1926 |
| Value | Count | Frequency (%) |
| 1936 | 1 | |
| 1926 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 8 |
|---|---|
| 2nd row | 8 |
| Value | Count | Frequency (%) |
| 8 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 2 |
Missing 
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1265790 |
| Missing (%) | 65.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 20 |
| Mean length | 20.10026748 |
| Min length | 2 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | unknown, from legacy |
|---|---|
| 2nd row | unknown, from legacy |
| 3rd row | unknown, from legacy |
| 4th row | unknown, from legacy |
| 5th row | unknown, from legacy |
| Value | Count | Frequency (%) |
| from | 509060 | |
| unknown | 507577 | |
| legacy | 505126 | |
| geolocate | 70310 | 3.6% |
| names | 41937 | 2.2% |
| geographic | 41556 | 2.1% |
| of | 35279 | 1.8% |
| getty | 34687 | 1.8% |
| thesaurus | 34686 | 1.8% |
| may | 23191 | 1.2% |
| Other values (131) | 141522 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1560807 | 11.8% |
| 1284328 | 9.7% | |
| o | 1253394 | 9.4% |
| e | 822048 | 6.2% |
| a | 797027 | 6.0% |
| r | 642026 | 4.8% |
| c | 624647 | 4.7% |
| g | 591299 | 4.5% |
| u | 580748 | 4.4% |
| y | 577424 | 4.3% |
| Other values (54) | 4544549 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10761243 | |
| Space Separator | 1284328 | 9.7% |
| Uppercase Letter | 560123 | 4.2% |
| Other Punctuation | 551606 | 4.2% |
| Decimal Number | 114476 | 0.9% |
| Dash Punctuation | 3269 | < 0.1% |
| Close Punctuation | 1624 | < 0.1% |
| Open Punctuation | 1624 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1560807 | |
| o | 1253394 | |
| e | 822048 | 7.6% |
| a | 797027 | 7.4% |
| r | 642026 | 6.0% |
| c | 624647 | 5.8% |
| g | 591299 | 5.5% |
| u | 580748 | 5.4% |
| y | 577424 | 5.4% |
| m | 572941 | 5.3% |
| Other values (14) | 2738882 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 185819 | |
| L | 76699 | |
| E | 75168 | |
| O | 56836 | 10.1% |
| N | 43901 | 7.8% |
| T | 36738 | 6.6% |
| M | 26366 | 4.7% |
| S | 23940 | 4.3% |
| U | 8298 | 1.5% |
| I | 8277 | 1.5% |
| Other values (9) | 18081 | 3.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52676 | |
| 2 | 49533 | |
| 9 | 5900 | 5.2% |
| 4 | 2928 | 2.6% |
| 1 | 1976 | 1.7% |
| 5 | 1442 | 1.3% |
| 8 | 15 | < 0.1% |
| 7 | 4 | < 0.1% |
| 3 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 528985 | |
| / | 9412 | 1.7% |
| . | 9111 | 1.7% |
| : | 3483 | 0.6% |
| & | 594 | 0.1% |
| ! | 18 | < 0.1% |
| ' | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1284328 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3269 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1624 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1624 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11321366 | |
| Common | 1956931 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1560807 | |
| o | 1253394 | 11.1% |
| e | 822048 | 7.3% |
| a | 797027 | 7.0% |
| r | 642026 | 5.7% |
| c | 624647 | 5.5% |
| g | 591299 | 5.2% |
| u | 580748 | 5.1% |
| y | 577424 | 5.1% |
| m | 572941 | 5.1% |
| Other values (33) | 3299005 |
Common
| Value | Count | Frequency (%) |
| 1284328 | ||
| , | 528985 | |
| 0 | 52676 | 2.7% |
| 2 | 49533 | 2.5% |
| / | 9412 | 0.5% |
| . | 9111 | 0.5% |
| 9 | 5900 | 0.3% |
| : | 3483 | 0.2% |
| - | 3269 | 0.2% |
| 4 | 2928 | 0.1% |
| Other values (11) | 7306 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13278297 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1560807 | 11.8% |
| 1284328 | 9.7% | |
| o | 1253394 | 9.4% |
| e | 822048 | 6.2% |
| a | 797027 | 6.0% |
| r | 642026 | 4.8% |
| c | 624647 | 4.7% |
| g | 591299 | 4.5% |
| u | 580748 | 4.4% |
| y | 577424 | 4.3% |
| Other values (54) | 4544549 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 9.666666667 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | PARATYPE |
| Value | Count | Frequency (%) |
| paratype | 2 | |
| north_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6 | |
| P | 4 | |
| R | 4 | |
| T | 3 | |
| E | 3 | |
| Y | 2 | 6.9% |
| N | 1 | 3.4% |
| O | 1 | 3.4% |
| H | 1 | 3.4% |
| _ | 1 | 3.4% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 28 | |
| Connector Punctuation | 1 | 3.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6 | |
| P | 4 | |
| R | 4 | |
| T | 3 | |
| E | 3 | |
| Y | 2 | 7.1% |
| N | 1 | 3.6% |
| O | 1 | 3.6% |
| H | 1 | 3.6% |
| M | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28 | |
| Common | 1 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6 | |
| P | 4 | |
| R | 4 | |
| T | 3 | |
| E | 3 | |
| Y | 2 | 7.1% |
| N | 1 | 3.6% |
| O | 1 | 3.6% |
| H | 1 | 3.6% |
| M | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Common
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6 | |
| P | 4 | |
| R | 4 | |
| T | 3 | |
| E | 3 | |
| Y | 2 | 6.9% |
| N | 1 | 3.4% |
| O | 1 | 3.4% |
| H | 1 | 3.4% |
| _ | 1 | 3.4% |
| Other values (3) | 3 |
Missing 
| Distinct | 4822 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 1896105 |
| Missing (%) | 98.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 118 |
| Mean length | 23.03717644 |
| Min length | 1 |
Unique
| Unique | 3165 ? |
|---|---|
| Unique (%) | 10.4% |
Sample
| 1st row | Extended About 16 Km Offshore From Crystal River Power Plant |
|---|---|
| 2nd row | 0.8 mile west of Montgomery-Polk county line, north side of |
| 3rd row | San Andreas Fault |
| 4th row | 6 Mile W Of Watsonville |
| 5th row | from Holt data card |
| Value | Count | Frequency (%) |
| approximate | 9789 | 8.9% |
| from | 6478 | 5.9% |
| river | 3464 | 3.2% |
| of | 3097 | 2.8% |
| about | 3076 | 2.8% |
| 16 | 2974 | 2.7% |
| km | 2970 | 2.7% |
| plant | 2933 | 2.7% |
| power | 2929 | 2.7% |
| offshore | 2929 | 2.7% |
| Other values (4971) | 68760 |
Most occurring characters
| Value | Count | Frequency (%) |
| 79111 | 11.3% | |
| a | 60517 | 8.7% |
| e | 55652 | 8.0% |
| o | 49194 | 7.1% |
| r | 47507 | 6.8% |
| t | 40249 | 5.8% |
| i | 29470 | 4.2% |
| n | 26681 | 3.8% |
| p | 24672 | 3.5% |
| m | 24234 | 3.5% |
| Other values (68) | 260463 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 519837 | |
| Space Separator | 79111 | 11.3% |
| Uppercase Letter | 71789 | 10.3% |
| Decimal Number | 14985 | 2.1% |
| Other Punctuation | 10495 | 1.5% |
| Close Punctuation | 574 | 0.1% |
| Open Punctuation | 570 | 0.1% |
| Dash Punctuation | 354 | 0.1% |
| Math Symbol | 35 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 60517 | |
| e | 55652 | |
| o | 49194 | 9.5% |
| r | 47507 | 9.1% |
| t | 40249 | 7.7% |
| i | 29470 | 5.7% |
| n | 26681 | 5.1% |
| p | 24672 | 4.7% |
| m | 24234 | 4.7% |
| l | 23891 | 4.6% |
| Other values (16) | 137770 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 8828 | |
| R | 7369 | |
| C | 6873 | 9.6% |
| O | 6477 | 9.0% |
| B | 4747 | 6.6% |
| A | 4364 | 6.1% |
| F | 4160 | 5.8% |
| E | 3949 | 5.5% |
| S | 3870 | 5.4% |
| K | 3632 | 5.1% |
| Other values (16) | 17520 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4141 | |
| 6 | 3403 | |
| 5 | 1661 | |
| 0 | 1443 | 9.6% |
| 3 | 1434 | 9.6% |
| 2 | 951 | 6.3% |
| 4 | 876 | 5.8% |
| 7 | 488 | 3.3% |
| 8 | 412 | 2.7% |
| 9 | 176 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5320 | |
| . | 2062 | 19.6% |
| / | 1922 | 18.3% |
| : | 460 | 4.4% |
| ' | 363 | 3.5% |
| ; | 284 | 2.7% |
| " | 42 | 0.4% |
| & | 23 | 0.2% |
| # | 19 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 564 | |
| ] | 10 | 1.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 560 | |
| [ | 10 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 79111 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 354 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 35 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 591626 | |
| Common | 106124 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 60517 | 10.2% |
| e | 55652 | 9.4% |
| o | 49194 | 8.3% |
| r | 47507 | 8.0% |
| t | 40249 | 6.8% |
| i | 29470 | 5.0% |
| n | 26681 | 4.5% |
| p | 24672 | 4.2% |
| m | 24234 | 4.1% |
| l | 23891 | 4.0% |
| Other values (42) | 209559 |
Common
| Value | Count | Frequency (%) |
| 79111 | ||
| , | 5320 | 5.0% |
| 1 | 4141 | 3.9% |
| 6 | 3403 | 3.2% |
| . | 2062 | 1.9% |
| / | 1922 | 1.8% |
| 5 | 1661 | 1.6% |
| 0 | 1443 | 1.4% |
| 3 | 1434 | 1.4% |
| 2 | 951 | 0.9% |
| Other values (16) | 4676 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 697750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 79111 | 11.3% | |
| a | 60517 | 8.7% |
| e | 55652 | 8.0% |
| o | 49194 | 7.1% |
| r | 47507 | 6.8% |
| t | 40249 | 5.8% |
| i | 29470 | 4.2% |
| n | 26681 | 3.8% |
| p | 24672 | 3.5% |
| m | 24234 | 3.5% |
| Other values (68) | 260463 |
latestEonOrHighestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | US |
|---|
| Value | Count | Frequency (%) |
| us | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Idaho |
|---|
| Value | Count | Frequency (%) |
| idaho | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 | |
| Uppercase Letter | 1 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6482728 |
|---|---|
| 2nd row | 2504455 |
| Value | Count | Frequency (%) |
| 6482728 | 1 | |
| 2504455 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 75 |
|---|---|
| Median length | 37 |
| Mean length | 46.66666667 |
| Min length | 28 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, North Pacific Ocean, Departure Bay, Canada, British Columbia |
|---|---|
| 2nd row | North America, United States, Georgia |
| 3rd row | North America, United States |
| Value | Count | Frequency (%) |
| north | 4 | |
| america | 3 | |
| united | 2 | |
| states | 2 | |
| pacific | 1 | 5.3% |
| ocean | 1 | 5.3% |
| departure | 1 | 5.3% |
| bay | 1 | 5.3% |
| canada | 1 | 5.3% |
| british | 1 | 5.3% |
| Other values (2) | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16 | 11.4% | |
| a | 14 | 10.0% |
| t | 12 | 8.6% |
| r | 11 | 7.9% |
| e | 11 | 7.9% |
| i | 11 | 7.9% |
| , | 7 | 5.0% |
| o | 6 | 4.3% |
| c | 6 | 4.3% |
| h | 5 | 3.6% |
| Other values (21) | 41 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 98 | |
| Uppercase Letter | 19 | 13.6% |
| Space Separator | 16 | 11.4% |
| Other Punctuation | 7 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14 | |
| t | 12 | |
| r | 11 | |
| e | 11 | |
| i | 11 | |
| o | 6 | |
| c | 6 | |
| h | 5 | 5.1% |
| n | 4 | 4.1% |
| m | 4 | 4.1% |
| Other values (9) | 14 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4 | |
| A | 3 | |
| S | 2 | |
| U | 2 | |
| B | 2 | |
| C | 2 | |
| G | 1 | 5.3% |
| O | 1 | 5.3% |
| D | 1 | 5.3% |
| P | 1 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 16 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 117 | |
| Common | 23 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14 | |
| t | 12 | 10.3% |
| r | 11 | 9.4% |
| e | 11 | 9.4% |
| i | 11 | 9.4% |
| o | 6 | 5.1% |
| c | 6 | 5.1% |
| h | 5 | 4.3% |
| n | 4 | 3.4% |
| N | 4 | 3.4% |
| Other values (19) | 33 |
Common
| Value | Count | Frequency (%) |
| 16 | ||
| , | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 140 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 16 | 11.4% | |
| a | 14 | 10.0% |
| t | 12 | 8.6% |
| r | 11 | 7.9% |
| e | 11 | 7.9% |
| i | 11 | 7.9% |
| , | 7 | 5.0% |
| o | 6 | 4.3% |
| c | 6 | 4.3% |
| h | 5 | 3.6% |
| Other values (21) | 41 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 6 | |
| A | 6 | |
| N | 3 | |
| O | 3 | |
| T | 3 | |
| H | 3 | |
| _ | 3 | |
| M | 3 | |
| E | 3 | |
| I | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 36 | |
| Connector Punctuation | 3 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 6 | |
| A | 6 | |
| N | 3 | |
| O | 3 | |
| T | 3 | |
| H | 3 | |
| M | 3 | |
| E | 3 | |
| I | 3 | |
| C | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36 | |
| Common | 3 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 6 | |
| A | 6 | |
| N | 3 | |
| O | 3 | |
| T | 3 | |
| H | 3 | |
| M | 3 | |
| E | 3 | |
| I | 3 | |
| C | 3 |
Common
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 6 | |
| A | 6 | |
| N | 3 | |
| O | 3 | |
| T | 3 | |
| H | 3 | |
| _ | 3 | |
| M | 3 | |
| E | 3 | |
| I | 3 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 34 |
| Mean length | 34 |
| Min length | 34 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North Pacific Ocean, Departure Bay |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| pacific | 1 | |
| ocean | 1 | |
| departure | 1 | |
| bay | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 11.8% | |
| a | 4 | 11.8% |
| r | 3 | 8.8% |
| c | 3 | 8.8% |
| e | 3 | 8.8% |
| t | 2 | 5.9% |
| i | 2 | 5.9% |
| N | 1 | 2.9% |
| , | 1 | 2.9% |
| B | 1 | 2.9% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 5 | 14.7% |
| Space Separator | 4 | 11.8% |
| Other Punctuation | 1 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| t | 2 | |
| i | 2 | |
| u | 1 | 4.2% |
| p | 1 | 4.2% |
| f | 1 | 4.2% |
| n | 1 | 4.2% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| B | 1 | |
| D | 1 | |
| O | 1 | |
| P | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 | |
| Common | 5 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | 10.3% |
| c | 3 | 10.3% |
| e | 3 | 10.3% |
| t | 2 | 6.9% |
| i | 2 | 6.9% |
| N | 1 | 3.4% |
| B | 1 | 3.4% |
| u | 1 | 3.4% |
| p | 1 | 3.4% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| , | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 11.8% | |
| a | 4 | 11.8% |
| r | 3 | 8.8% |
| c | 3 | 8.8% |
| e | 3 | 8.8% |
| t | 2 | 5.9% |
| i | 2 | 5.9% |
| N | 1 | 2.9% |
| , | 1 | 2.9% |
| B | 1 | 2.9% |
| Other values (10) | 10 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 1926388 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 2 |
| Mean length | 18.8 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 60.0% |
Sample
| 1st row | Hemionchos striatus Campbell & Beveridge, 2006 |
|---|---|
| 2nd row | CA |
| 3rd row | US |
| 4th row | US |
| 5th row | Conspicuum icteridorum Denton & Byrd, 1951 |
| Value | Count | Frequency (%) |
| us | 2 | |
| 2 | ||
| hemionchos | 1 | 6.7% |
| striatus | 1 | 6.7% |
| campbell | 1 | 6.7% |
| beveridge | 1 | 6.7% |
| 2006 | 1 | 6.7% |
| ca | 1 | 6.7% |
| conspicuum | 1 | 6.7% |
| icteridorum | 1 | 6.7% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10 | 10.6% | |
| e | 7 | 7.4% |
| i | 6 | 6.4% |
| o | 5 | 5.3% |
| r | 5 | 5.3% |
| u | 4 | 4.3% |
| m | 4 | 4.3% |
| n | 4 | 4.3% |
| s | 4 | 4.3% |
| t | 4 | 4.3% |
| Other values (25) | 41 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 60 | |
| Uppercase Letter | 12 | 12.8% |
| Space Separator | 10 | 10.6% |
| Decimal Number | 8 | 8.5% |
| Other Punctuation | 4 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7 | |
| i | 6 | |
| o | 5 | 8.3% |
| r | 5 | 8.3% |
| u | 4 | 6.7% |
| m | 4 | 6.7% |
| n | 4 | 6.7% |
| s | 4 | 6.7% |
| t | 4 | 6.7% |
| d | 3 | 5.0% |
| Other values (9) | 14 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3 | |
| U | 2 | |
| B | 2 | |
| S | 2 | |
| A | 1 | 8.3% |
| D | 1 | 8.3% |
| H | 1 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 2 | |
| 2 | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 | |
| & | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72 | |
| Common | 22 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7 | 9.7% |
| i | 6 | 8.3% |
| o | 5 | 6.9% |
| r | 5 | 6.9% |
| u | 4 | 5.6% |
| m | 4 | 5.6% |
| n | 4 | 5.6% |
| s | 4 | 5.6% |
| t | 4 | 5.6% |
| d | 3 | 4.2% |
| Other values (16) | 26 |
Common
| Value | Count | Frequency (%) |
| 10 | ||
| 0 | 2 | 9.1% |
| , | 2 | 9.1% |
| & | 2 | 9.1% |
| 1 | 2 | 9.1% |
| 2 | 1 | 4.5% |
| 6 | 1 | 4.5% |
| 9 | 1 | 4.5% |
| 5 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10 | 10.6% | |
| e | 7 | 7.4% |
| i | 6 | 6.4% |
| o | 5 | 5.3% |
| r | 5 | 5.3% |
| u | 4 | 4.3% |
| m | 4 | 4.3% |
| n | 4 | 4.3% |
| s | 4 | 4.3% |
| t | 4 | 4.3% |
| Other values (25) | 41 |
group
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11.5 |
| Mean length | 11.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | British Columbia |
|---|---|
| 2nd row | Georgia |
| Value | Count | Frequency (%) |
| british | 1 | |
| columbia | 1 | |
| georgia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4 | |
| o | 2 | 8.7% |
| a | 2 | 8.7% |
| r | 2 | 8.7% |
| u | 1 | 4.3% |
| e | 1 | 4.3% |
| G | 1 | 4.3% |
| b | 1 | 4.3% |
| m | 1 | 4.3% |
| B | 1 | 4.3% |
| Other values (7) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19 | |
| Uppercase Letter | 3 | 13.0% |
| Space Separator | 1 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| o | 2 | |
| a | 2 | |
| r | 2 | |
| u | 1 | 5.3% |
| e | 1 | 5.3% |
| b | 1 | 5.3% |
| m | 1 | 5.3% |
| l | 1 | 5.3% |
| h | 1 | 5.3% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| B | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22 | |
| Common | 1 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| o | 2 | 9.1% |
| a | 2 | 9.1% |
| r | 2 | 9.1% |
| u | 1 | 4.5% |
| e | 1 | 4.5% |
| G | 1 | 4.5% |
| b | 1 | 4.5% |
| m | 1 | 4.5% |
| B | 1 | 4.5% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4 | |
| o | 2 | 8.7% |
| a | 2 | 8.7% |
| r | 2 | 8.7% |
| u | 1 | 4.3% |
| e | 1 | 4.3% |
| G | 1 | 4.3% |
| b | 1 | 4.3% |
| m | 1 | 4.3% |
| B | 1 | 4.3% |
| Other values (7) | 7 |
bed
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Moultrie |
|---|
| Value | Count | Frequency (%) |
| moultrie | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1908260 |
| Missing (%) | 99.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 3 |
| Mean length | 3.553796945 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | cf. |
|---|---|
| 2nd row | cf. |
| 3rd row | uncertain |
| 4th row | cf. |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| cf | 15638 | |
| uncertain | 1489 | 8.2% |
| aff | 600 | 3.3% |
| near | 404 | 2.2% |
| animalia | 2 | < 0.1% |
| platyhelminthes | 2 | < 0.1% |
| cestoda | 1 | < 0.1% |
| trematoda | 1 | < 0.1% |
| digenea | 1 | < 0.1% |
| plagiorchiida | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 17130 | |
| f | 16838 | |
| . | 16238 | |
| n | 3387 | 5.3% |
| a | 2506 | 3.9% |
| e | 1903 | 3.0% |
| r | 1896 | 2.9% |
| i | 1502 | 2.3% |
| t | 1495 | 2.3% |
| u | 1487 | 2.3% |
| Other values (16) | 59 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48178 | |
| Other Punctuation | 16245 | 25.2% |
| Uppercase Letter | 11 | < 0.1% |
| Space Separator | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 17130 | |
| f | 16838 | |
| n | 3387 | 7.0% |
| a | 2506 | 5.2% |
| e | 1903 | 3.9% |
| r | 1896 | 3.9% |
| i | 1502 | 3.1% |
| t | 1495 | 3.1% |
| u | 1487 | 3.1% |
| l | 8 | < 0.1% |
| Other values (7) | 26 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3 | |
| A | 2 | |
| U | 2 | |
| D | 2 | |
| C | 1 | 9.1% |
| T | 1 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 16238 | |
| , | 7 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48189 | |
| Common | 16252 | 25.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 17130 | |
| f | 16838 | |
| n | 3387 | 7.0% |
| a | 2506 | 5.2% |
| e | 1903 | 3.9% |
| r | 1896 | 3.9% |
| i | 1502 | 3.1% |
| t | 1495 | 3.1% |
| u | 1487 | 3.1% |
| l | 8 | < 0.1% |
| Other values (13) | 37 | 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 16238 | |
| , | 7 | < 0.1% |
| 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64441 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 17130 | |
| f | 16838 | |
| . | 16238 | |
| n | 3387 | 5.3% |
| a | 2506 | 3.9% |
| e | 1903 | 3.0% |
| r | 1896 | 2.9% |
| i | 1502 | 2.3% |
| t | 1495 | 2.3% |
| u | 1487 | 2.3% |
| Other values (16) | 59 | 0.1% |
typeStatus
Text
Missing 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1841066 |
| Missing (%) | 95.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.724987401 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | HOLOTYPE |
| 3rd row | PARATYPE |
| 4th row | HOLOTYPE |
| 5th row | PARATYPE |
| Value | Count | Frequency (%) |
| paratype | 40578 | |
| holotype | 25358 | |
| syntype | 9555 | 11.2% |
| type | 4807 | 5.6% |
| allotype | 2818 | 3.3% |
| lectotype | 862 | 1.0% |
| paralectotype | 795 | 0.9% |
| neotype | 294 | 0.3% |
| hapantotype | 242 | 0.3% |
| paraneotype | 16 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 126956 | |
| Y | 94880 | |
| E | 87292 | |
| T | 87224 | |
| A | 86082 | |
| O | 55743 | |
| R | 41389 | 6.3% |
| L | 32651 | 5.0% |
| H | 25600 | 3.9% |
| N | 10107 | 1.5% |
| Other values (7) | 11226 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 659136 | |
| Lowercase Letter | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 126956 | |
| Y | 94880 | |
| E | 87292 | |
| T | 87224 | |
| A | 86082 | |
| O | 55743 | |
| R | 41389 | 6.3% |
| L | 32651 | 5.0% |
| H | 25600 | 3.9% |
| N | 10107 | 1.5% |
| Other values (2) | 11212 | 1.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| n | 2 | |
| m | 2 | |
| l | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 659150 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 126956 | |
| Y | 94880 | |
| E | 87292 | |
| T | 87224 | |
| A | 86082 | |
| O | 55743 | |
| R | 41389 | 6.3% |
| L | 32651 | 5.0% |
| H | 25600 | 3.9% |
| N | 10107 | 1.5% |
| Other values (7) | 11226 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 659150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 126956 | |
| Y | 94880 | |
| E | 87292 | |
| T | 87224 | |
| A | 86082 | |
| O | 55743 | |
| R | 41389 | 6.3% |
| L | 32651 | 5.0% |
| H | 25600 | 3.9% |
| N | 10107 | 1.5% |
| Other values (7) | 11226 | 1.7% |
identifiedBy
Text
Missing 
| Distinct | 13461 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 1085208 |
| Missing (%) | 56.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 226 |
|---|---|
| Median length | 133 |
| Mean length | 38.24106825 |
| Min length | 2 |
Unique
| Unique | 4200 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Opresko, Dennis M., Oak Ridge National Laboratory (UNITED STATES) |
|---|---|
| 2nd row | Nance |
| 3rd row | Mah, Christopher, (IZ), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Verrill, Addison E., Peabody Museum, Yale |
| 5th row | Judkins, D. |
| Value | Count | Frequency (%) |
| of | 247193 | 5.3% |
| museum | 200643 | 4.3% |
| national | 197127 | 4.2% |
| institution | 188591 | 4.1% |
| smithsonian | 186061 | 4.0% |
| natural | 185777 | 4.0% |
| history | 185423 | 4.0% |
| united | 130413 | 2.8% |
| states | 129643 | 2.8% |
| 87200 | 1.9% | |
| Other values (9433) | 2904278 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3801164 | 11.8% | |
| a | 2080528 | 6.5% |
| i | 2056250 | 6.4% |
| t | 2013216 | 6.3% |
| n | 1896071 | 5.9% |
| o | 1744817 | 5.4% |
| e | 1500120 | 4.7% |
| r | 1384928 | 4.3% |
| s | 1382760 | 4.3% |
| , | 1349377 | 4.2% |
| Other values (84) | 12958582 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19467461 | |
| Uppercase Letter | 5957582 | 18.5% |
| Space Separator | 3801164 | 11.8% |
| Other Punctuation | 2377372 | 7.4% |
| Open Punctuation | 230321 | 0.7% |
| Close Punctuation | 230321 | 0.7% |
| Dash Punctuation | 97651 | 0.3% |
| Decimal Number | 5852 | < 0.1% |
| Math Symbol | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2080528 | |
| i | 2056250 | |
| t | 2013216 | |
| n | 1896071 | |
| o | 1744817 | |
| e | 1500120 | |
| r | 1384928 | |
| s | 1382760 | |
| u | 1079811 | 5.5% |
| l | 969734 | 5.0% |
| Other values (37) | 3359226 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 646353 | 10.8% |
| N | 570409 | 9.6% |
| M | 471253 | 7.9% |
| I | 456265 | 7.7% |
| T | 454120 | 7.6% |
| H | 422947 | 7.1% |
| E | 378825 | 6.4% |
| A | 333756 | 5.6% |
| D | 272623 | 4.6% |
| C | 241437 | 4.1% |
| Other values (18) | 1709594 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1349377 | |
| . | 937315 | |
| ; | 64078 | 2.7% |
| / | 16442 | 0.7% |
| & | 5588 | 0.2% |
| ' | 4526 | 0.2% |
| " | 46 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2732 | |
| 1 | 2732 | |
| 2 | 148 | 2.5% |
| 0 | 92 | 1.6% |
| 6 | 74 | 1.3% |
| 9 | 74 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 97644 | |
| – | 7 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3801164 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 230321 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 230321 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25425043 | |
| Common | 6742770 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2080528 | 8.2% |
| i | 2056250 | 8.1% |
| t | 2013216 | 7.9% |
| n | 1896071 | 7.5% |
| o | 1744817 | 6.9% |
| e | 1500120 | 5.9% |
| r | 1384928 | 5.4% |
| s | 1382760 | 5.4% |
| u | 1079811 | 4.2% |
| l | 969734 | 3.8% |
| Other values (65) | 9316808 |
Common
| Value | Count | Frequency (%) |
| 3801164 | ||
| , | 1349377 | 20.0% |
| . | 937315 | 13.9% |
| ( | 230321 | 3.4% |
| ) | 230321 | 3.4% |
| - | 97644 | 1.4% |
| ; | 64078 | 1.0% |
| / | 16442 | 0.2% |
| & | 5588 | 0.1% |
| ' | 4526 | 0.1% |
| Other values (9) | 5994 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32162269 | |
| None | 5537 | < 0.1% |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3801164 | 11.8% | |
| a | 2080528 | 6.5% |
| i | 2056250 | 6.4% |
| t | 2013216 | 6.3% |
| n | 1896071 | 5.9% |
| o | 1744817 | 5.4% |
| e | 1500120 | 4.7% |
| r | 1384928 | 4.3% |
| s | 1382760 | 4.3% |
| , | 1349377 | 4.2% |
| Other values (60) | 12953038 |
None
| Value | Count | Frequency (%) |
| é | 1460 | |
| í | 1289 | |
| á | 848 | |
| ñ | 436 | 7.9% |
| ã | 401 | 7.2% |
| è | 285 | 5.1% |
| ö | 217 | 3.9% |
| ç | 159 | 2.9% |
| ó | 99 | 1.8% |
| ø | 98 | 1.8% |
| Other values (13) | 245 | 4.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 7 |
identifiedByID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Cestoda |
|---|---|
| 2nd row | Trematoda |
| Value | Count | Frequency (%) |
| cestoda | 1 | |
| trematoda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| t | 2 | |
| o | 2 | |
| d | 2 | |
| C | 1 | 6.2% |
| s | 1 | 6.2% |
| T | 1 | 6.2% |
| r | 1 | 6.2% |
| m | 1 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 2 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| t | 2 | |
| o | 2 | |
| d | 2 | |
| s | 1 | 7.1% |
| r | 1 | 7.1% |
| m | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| t | 2 | |
| o | 2 | |
| d | 2 | |
| C | 1 | 6.2% |
| s | 1 | 6.2% |
| T | 1 | 6.2% |
| r | 1 | 6.2% |
| m | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 2 | |
| t | 2 | |
| o | 2 | |
| d | 2 | |
| C | 1 | 6.2% |
| s | 1 | 6.2% |
| T | 1 | 6.2% |
| r | 1 | 6.2% |
| m | 1 | 6.2% |
dateIdentified
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13.5 |
| Mean length | 13.5 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Trypanorhyncha |
|---|---|
| 2nd row | Plagiorchiida |
| Value | Count | Frequency (%) |
| trypanorhyncha | 1 | |
| plagiorchiida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| h | 3 | |
| i | 3 | |
| y | 2 | |
| n | 2 | |
| o | 2 | |
| c | 2 | |
| T | 1 | 3.7% |
| p | 1 | 3.7% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 | |
| Uppercase Letter | 2 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| h | 3 | |
| i | 3 | |
| y | 2 | |
| n | 2 | |
| o | 2 | |
| c | 2 | |
| p | 1 | 4.0% |
| l | 1 | 4.0% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| h | 3 | |
| i | 3 | |
| y | 2 | |
| n | 2 | |
| o | 2 | |
| c | 2 | |
| T | 1 | 3.7% |
| p | 1 | 3.7% |
| Other values (4) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| h | 3 | |
| i | 3 | |
| y | 2 | |
| n | 2 | |
| o | 2 | |
| c | 2 | |
| T | 1 | 3.7% |
| p | 1 | 3.7% |
| Other values (4) | 4 |
identificationVerificationStatus
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 12.66666667 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Eutetrarhynchidae |
|---|---|
| 2nd row | 31.1435 |
| 3rd row | Dicrocoeliidae |
| Value | Count | Frequency (%) |
| eutetrarhynchidae | 1 | |
| 31.1435 | 1 | |
| dicrocoeliidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4 | 10.5% |
| e | 4 | 10.5% |
| r | 3 | 7.9% |
| a | 3 | 7.9% |
| c | 3 | 7.9% |
| t | 2 | 5.3% |
| h | 2 | 5.3% |
| o | 2 | 5.3% |
| d | 2 | 5.3% |
| 3 | 2 | 5.3% |
| Other values (10) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29 | |
| Decimal Number | 6 | 15.8% |
| Uppercase Letter | 2 | 5.3% |
| Other Punctuation | 1 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| e | 4 | |
| r | 3 | |
| a | 3 | |
| c | 3 | |
| t | 2 | |
| h | 2 | |
| o | 2 | |
| d | 2 | |
| u | 1 | 3.4% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 1 | 2 | |
| 4 | 1 | |
| 5 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| E | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31 | |
| Common | 7 | 18.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| e | 4 | |
| r | 3 | |
| a | 3 | |
| c | 3 | |
| t | 2 | 6.5% |
| h | 2 | 6.5% |
| o | 2 | 6.5% |
| d | 2 | 6.5% |
| D | 1 | 3.2% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 1 | 2 | |
| 4 | 1 | |
| 5 | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4 | 10.5% |
| e | 4 | 10.5% |
| r | 3 | 7.9% |
| a | 3 | 7.9% |
| c | 3 | 7.9% |
| t | 2 | 5.3% |
| h | 2 | 5.3% |
| o | 2 | 5.3% |
| d | 2 | 5.3% |
| 3 | 2 | 5.3% |
| Other values (10) | 11 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -83.7685 |
|---|
| Value | Count | Frequency (%) |
| 83.7685 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 3 | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
| Distinct | 94526 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 2069 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.457629796 |
| Min length | 1 |
Unique
| Unique | 27027 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 2237081 |
|---|---|
| 2nd row | 5189992 |
| 3rd row | 2258402 |
| 4th row | 5187825 |
| 5th row | 9722403 |
| Value | Count | Frequency (%) |
| 225 | 23786 | 1.2% |
| 5967481 | 15294 | 0.8% |
| 105 | 11162 | 0.6% |
| 52 | 8679 | 0.5% |
| 7296 | 8105 | 0.4% |
| 637 | 6531 | 0.3% |
| 137 | 6505 | 0.3% |
| 6540 | 4668 | 0.2% |
| 255 | 4580 | 0.2% |
| 256 | 4175 | 0.2% |
| Other values (94516) | 1830839 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 | |
| Other values (12) | 20 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12426552 | |
| Lowercase Letter | 18 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| n | 2 | |
| s | 2 | |
| i | 2 | |
| c | 2 | |
| u | 2 | |
| m | 2 | |
| p | 1 | 5.6% |
| e | 1 | 5.6% |
| h | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| H | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12426552 | |
| Latin | 20 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| n | 2 | |
| s | 2 | |
| i | 2 | |
| c | 2 | |
| u | 2 | |
| m | 2 | |
| C | 1 | 5.0% |
| p | 1 | 5.0% |
| H | 1 | 5.0% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12426572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 | |
| Other values (12) | 20 | < 0.1% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Hemionchos |
|---|---|
| 2nd row | Conspicuum |
| Value | Count | Frequency (%) |
| hemionchos | 1 | |
| conspicuum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| e | 1 | 5.6% |
| h | 1 | 5.6% |
| p | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9.5 |
| Mean length | 9.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | striatus |
|---|---|
| 2nd row | icteridorum |
| Value | Count | Frequency (%) |
| striatus | 1 | |
| icteridorum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
scientificName
Text
| Distinct | 113079 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 168 |
|---|---|
| Median length | 102 |
| Mean length | 29.16433821 |
| Min length | 5 |
Unique
| Unique | 38721 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Scypha Gray, 1821 |
|---|---|
| 2nd row | Bulla striata Bruguière, 1792 |
| 3rd row | Stylopathes columnaris (Duchassaing, 1870) |
| 4th row | Ophiothrix suensonii Lütken, 1856 |
| 5th row | Cypraea labrolineata Gaskoin, 1849 |
| Value | Count | Frequency (%) |
| 136410 | 2.0% | |
| linnaeus | 96753 | 1.4% |
| 1758 | 81495 | 1.2% |
| say | 50998 | 0.8% |
| lamarck | 40009 | 0.6% |
| dall | 28184 | 0.4% |
| conus | 24224 | 0.4% |
| gastropoda | 23786 | 0.4% |
| 1791 | 23649 | 0.3% |
| gmelin | 23215 | 0.3% |
| Other values (70965) | 6239236 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4939373 | 8.8% |
| 4841572 | 8.6% | |
| i | 3725884 | 6.6% |
| e | 3410133 | 6.1% |
| r | 2844813 | 5.1% |
| s | 2669041 | 4.8% |
| o | 2472444 | 4.4% |
| l | 2451221 | 4.4% |
| n | 2432205 | 4.3% |
| t | 1939529 | 3.5% |
| Other values (106) | 24455587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37415604 | |
| Decimal Number | 6238420 | 11.1% |
| Space Separator | 4841572 | 8.6% |
| Uppercase Letter | 4124724 | 7.3% |
| Other Punctuation | 2108041 | 3.8% |
| Close Punctuation | 713780 | 1.3% |
| Open Punctuation | 713780 | 1.3% |
| Dash Punctuation | 25881 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4939373 | |
| i | 3725884 | |
| e | 3410133 | 9.1% |
| r | 2844813 | 7.6% |
| s | 2669041 | 7.1% |
| o | 2472444 | 6.6% |
| l | 2451221 | 6.6% |
| n | 2432205 | 6.5% |
| t | 1939529 | 5.2% |
| u | 1855689 | 5.0% |
| Other values (50) | 8675272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 397556 | 9.6% |
| C | 388252 | 9.4% |
| P | 377283 | 9.1% |
| L | 347759 | 8.4% |
| M | 289125 | 7.0% |
| A | 288263 | 7.0% |
| B | 241357 | 5.9% |
| H | 229456 | 5.6% |
| G | 212079 | 5.1% |
| D | 169189 | 4.1% |
| Other values (27) | 1184405 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1867300 | |
| 8 | 1302965 | |
| 9 | 698227 | 11.2% |
| 7 | 528982 | 8.5% |
| 5 | 364870 | 5.8% |
| 6 | 311333 | 5.0% |
| 2 | 311221 | 5.0% |
| 0 | 297843 | 4.8% |
| 4 | 286353 | 4.6% |
| 3 | 269326 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1573893 | |
| . | 387974 | 18.4% |
| & | 136412 | 6.5% |
| ' | 9761 | 0.5% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4841572 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 713780 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 713780 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25881 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41540328 | |
| Common | 14641474 | 26.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4939373 | 11.9% |
| i | 3725884 | 9.0% |
| e | 3410133 | 8.2% |
| r | 2844813 | 6.8% |
| s | 2669041 | 6.4% |
| o | 2472444 | 6.0% |
| l | 2451221 | 5.9% |
| n | 2432205 | 5.9% |
| t | 1939529 | 4.7% |
| u | 1855689 | 4.5% |
| Other values (87) | 12799996 |
Common
| Value | Count | Frequency (%) |
| 4841572 | ||
| 1 | 1867300 | 12.8% |
| , | 1573893 | 10.7% |
| 8 | 1302965 | 8.9% |
| ) | 713780 | 4.9% |
| ( | 713780 | 4.9% |
| 9 | 698227 | 4.8% |
| 7 | 528982 | 3.6% |
| . | 387974 | 2.6% |
| 5 | 364870 | 2.5% |
| Other values (9) | 1648131 | 11.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56056544 | |
| None | 125258 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4939373 | 8.8% |
| 4841572 | 8.6% | |
| i | 3725884 | 6.6% |
| e | 3410133 | 6.1% |
| r | 2844813 | 5.1% |
| s | 2669041 | 4.8% |
| o | 2472444 | 4.4% |
| l | 2451221 | 4.4% |
| n | 2432205 | 4.3% |
| t | 1939529 | 3.5% |
| Other values (61) | 24330329 |
None
| Value | Count | Frequency (%) |
| ü | 33290 | |
| ö | 25666 | |
| è | 21744 | |
| é | 20812 | |
| ø | 8502 | 6.8% |
| å | 4702 | 3.8% |
| Ö | 4493 | 3.6% |
| á | 1535 | 1.2% |
| ä | 1285 | 1.0% |
| í | 836 | 0.7% |
| Other values (35) | 2393 | 1.9% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| Value | Count | Frequency (%) |
| species | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 4 | |
| E | 4 | |
| P | 2 | |
| C | 2 | |
| I | 2 |
parentNameUsage
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | GEOLocate |
|---|
| Value | Count | Frequency (%) |
| geolocate | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 4 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 |
| Distinct | 4354 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 469 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 134 |
|---|---|
| Median length | 117 |
| Mean length | 62.96739176 |
| Min length | 7 |
Unique
| Unique | 586 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Porifera, Calcarea |
|---|---|
| 2nd row | Animalia, Mollusca, Gastropoda, Bullidae |
| 3rd row | Animalia, Cnidaria, Anthozoa, Hexacorallia, Antipatharia, Stylopathidae |
| 4th row | Animalia, Echinodermata, Ophiuroidea, Ophiurida, Ophiotrichidae |
| 5th row | Animalia, Mollusca, Gastropoda, Cypraeidae |
| Value | Count | Frequency (%) |
| animalia | 1922044 | 18.1% |
| mollusca | 866407 | 8.1% |
| gastropoda | 612759 | 5.8% |
| arthropoda | 390750 | 3.7% |
| crustacea | 385110 | 3.6% |
| malacostraca | 301975 | 2.8% |
| eumalacostraca | 294895 | 2.8% |
| annelida | 241801 | 2.3% |
| polychaeta | 212969 | 2.0% |
| bivalvia | 207685 | 2.0% |
| Other values (4342) | 5202802 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 19360559 | |
| i | 10629446 | 8.8% |
| 8713273 | 7.2% | |
| , | 8691731 | 7.2% |
| o | 7923783 | 6.5% |
| l | 7526240 | 6.2% |
| e | 6162876 | 5.1% |
| d | 5675251 | 4.7% |
| r | 5612652 | 4.6% |
| c | 5023755 | 4.1% |
| Other values (50) | 35950845 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93247129 | |
| Uppercase Letter | 10617486 | 8.8% |
| Space Separator | 8713273 | 7.2% |
| Other Punctuation | 8691773 | 7.2% |
| Dash Punctuation | 283 | < 0.1% |
| Open Punctuation | 169 | < 0.1% |
| Close Punctuation | 169 | < 0.1% |
| Connector Punctuation | 126 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 19360559 | |
| i | 10629446 | |
| o | 7923783 | |
| l | 7526240 | 8.1% |
| e | 6162876 | 6.6% |
| d | 5675251 | 6.1% |
| r | 5612652 | 6.0% |
| c | 5023755 | 5.4% |
| n | 4723925 | 5.1% |
| t | 4393622 | 4.7% |
| Other values (16) | 16215020 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2993701 | |
| M | 1365751 | |
| C | 1145094 | 10.8% |
| P | 1046030 | 9.9% |
| E | 846110 | 8.0% |
| G | 714699 | 6.7% |
| S | 488625 | 4.6% |
| D | 335052 | 3.2% |
| B | 296981 | 2.8% |
| T | 261608 | 2.5% |
| Other values (15) | 1123835 | 10.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8691731 | |
| . | 28 | < 0.1% |
| ? | 14 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8713273 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 283 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 169 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 169 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 126 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 103864615 | |
| Common | 17405796 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 19360559 | |
| i | 10629446 | 10.2% |
| o | 7923783 | 7.6% |
| l | 7526240 | 7.2% |
| e | 6162876 | 5.9% |
| d | 5675251 | 5.5% |
| r | 5612652 | 5.4% |
| c | 5023755 | 4.8% |
| n | 4723925 | 4.5% |
| t | 4393622 | 4.2% |
| Other values (41) | 26832506 |
Common
| Value | Count | Frequency (%) |
| 8713273 | ||
| , | 8691731 | |
| - | 283 | < 0.1% |
| [ | 169 | < 0.1% |
| ] | 169 | < 0.1% |
| _ | 126 | < 0.1% |
| . | 28 | < 0.1% |
| ? | 14 | < 0.1% |
| + | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 121270411 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 19360559 | |
| i | 10629446 | 8.8% |
| 8713273 | 7.2% | |
| , | 8691731 | 7.2% |
| o | 7923783 | 6.5% |
| l | 7526240 | 6.2% |
| e | 6162876 | 5.1% |
| d | 5675251 | 4.7% |
| r | 5612652 | 4.6% |
| c | 5023755 | 4.1% |
| Other values (50) | 35950845 |
kingdom
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 8 |
| Mean length | 8.007927786 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 1920497 | |
| chromista | 2826 | 0.1% |
| incertae | 2065 | 0.1% |
| sedis | 2065 | 0.1% |
| protozoa | 964 | < 0.1% |
| bacteria | 35 | < 0.1% |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3847985 | |
| a | 3846927 | |
| m | 1923323 | |
| n | 1922562 | |
| A | 1920497 | |
| l | 1920497 | |
| s | 6956 | < 0.1% |
| e | 6232 | < 0.1% |
| r | 5890 | < 0.1% |
| t | 5890 | < 0.1% |
| Other values (21) | 19625 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13499953 | |
| Uppercase Letter | 1924322 | 12.5% |
| Space Separator | 2065 | < 0.1% |
| Decimal Number | 36 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3847985 | |
| a | 3846927 | |
| m | 1923323 | |
| n | 1922562 | |
| l | 1920497 | |
| s | 6956 | 0.1% |
| e | 6232 | < 0.1% |
| r | 5890 | < 0.1% |
| t | 5890 | < 0.1% |
| o | 5718 | < 0.1% |
| Other values (5) | 7973 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 6 | |
| 8 | 4 | |
| 3 | 4 | |
| 5 | 4 | |
| 9 | 4 | |
| 1 | 2 | 5.6% |
| 7 | 2 | 5.6% |
| 0 | 2 | 5.6% |
| 6 | 2 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1920497 | |
| C | 2826 | 0.1% |
| P | 964 | 0.1% |
| B | 35 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2065 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15424275 | |
| Common | 2109 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3847985 | |
| a | 3846927 | |
| m | 1923323 | |
| n | 1922562 | |
| A | 1920497 | |
| l | 1920497 | |
| s | 6956 | < 0.1% |
| e | 6232 | < 0.1% |
| r | 5890 | < 0.1% |
| t | 5890 | < 0.1% |
| Other values (9) | 17516 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 2065 | ||
| - | 8 | 0.4% |
| 2 | 6 | 0.3% |
| 4 | 6 | 0.3% |
| 8 | 4 | 0.2% |
| 3 | 4 | 0.2% |
| 5 | 4 | 0.2% |
| 9 | 4 | 0.2% |
| 1 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| Other values (2) | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15426384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3847985 | |
| a | 3846927 | |
| m | 1923323 | |
| n | 1922562 | |
| A | 1920497 | |
| l | 1920497 | |
| s | 6956 | < 0.1% |
| e | 6232 | < 0.1% |
| r | 5890 | < 0.1% |
| t | 5890 | < 0.1% |
| Other values (21) | 19625 | 0.1% |
phylum
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3160 |
| Missing (%) | 0.2% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 8 |
| Mean length | 8.850655641 |
| Min length | 2 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Porifera |
|---|---|
| 2nd row | Mollusca |
| 3rd row | Cnidaria |
| 4th row | Echinodermata |
| 5th row | Mollusca |
| Value | Count | Frequency (%) |
| mollusca | 864192 | |
| arthropoda | 392999 | |
| annelida | 241615 | 12.6% |
| cnidaria | 117703 | 6.1% |
| echinodermata | 91212 | 4.7% |
| nematoda | 68758 | 3.6% |
| platyhelminthes | 45840 | 2.4% |
| porifera | 32733 | 1.7% |
| chordata | 19745 | 1.0% |
| sipuncula | 10415 | 0.5% |
| Other values (42) | 38021 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2238532 | |
| l | 2078076 | |
| o | 1907515 | |
| r | 1110893 | 6.5% |
| c | 984617 | 5.8% |
| d | 936895 | 5.5% |
| s | 910659 | 5.3% |
| u | 885746 | 5.2% |
| M | 866329 | 5.1% |
| n | 769092 | 4.5% |
| Other values (30) | 4333519 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15098638 | |
| Uppercase Letter | 1923235 | 11.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2238532 | |
| l | 2078076 | |
| o | 1907515 | |
| r | 1110893 | |
| c | 984617 | 6.5% |
| d | 936895 | 6.2% |
| s | 910659 | 6.0% |
| u | 885746 | 5.9% |
| n | 769092 | 5.1% |
| t | 683305 | 4.5% |
| Other values (11) | 2593308 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 866329 | |
| A | 639230 | |
| C | 140451 | 7.3% |
| E | 91483 | 4.8% |
| P | 79547 | 4.1% |
| N | 75120 | 3.9% |
| S | 10431 | 0.5% |
| B | 9994 | 0.5% |
| K | 6389 | 0.3% |
| H | 2144 | 0.1% |
| Other values (9) | 2117 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17021873 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2238532 | |
| l | 2078076 | |
| o | 1907515 | |
| r | 1110893 | 6.5% |
| c | 984617 | 5.8% |
| d | 936895 | 5.5% |
| s | 910659 | 5.3% |
| u | 885746 | 5.2% |
| M | 866329 | 5.1% |
| n | 769092 | 4.5% |
| Other values (30) | 4333519 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17021873 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2238532 | |
| l | 2078076 | |
| o | 1907515 | |
| r | 1110893 | 6.5% |
| c | 984617 | 5.8% |
| d | 936895 | 5.5% |
| s | 910659 | 5.3% |
| u | 885746 | 5.2% |
| M | 866329 | 5.1% |
| n | 769092 | 4.5% |
| Other values (30) | 4333519 |
class
Text
Missing 
| Distinct | 116 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 66157 |
| Missing (%) | 3.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 10.05340075 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Calcarea |
|---|---|
| 2nd row | Gastropoda |
| 3rd row | Anthozoa |
| 4th row | Ophiuroidea |
| 5th row | Gastropoda |
| Value | Count | Frequency (%) |
| gastropoda | 610123 | |
| malacostraca | 301912 | |
| polychaeta | 211086 | 11.3% |
| bivalvia | 207854 | 11.2% |
| anthozoa | 93050 | 5.0% |
| copepoda | 46190 | 2.5% |
| chromadorea | 42750 | 2.3% |
| clitellata | 30336 | 1.6% |
| ophiuroidea | 27087 | 1.5% |
| asteroidea | 25635 | 1.4% |
| Other values (106) | 264213 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4042336 | |
| o | 2534615 | |
| t | 1401870 | 7.5% |
| r | 1169735 | 6.3% |
| s | 1022238 | 5.5% |
| d | 956343 | 5.1% |
| c | 944030 | 5.0% |
| l | 924665 | 4.9% |
| p | 848962 | 4.5% |
| i | 703184 | 3.8% |
| Other values (44) | 4153720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16841416 | |
| Uppercase Letter | 1860238 | 9.9% |
| Decimal Number | 34 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4042336 | |
| o | 2534615 | |
| t | 1401870 | 8.3% |
| r | 1169735 | 6.9% |
| s | 1022238 | 6.1% |
| d | 956343 | 5.7% |
| c | 944030 | 5.6% |
| l | 924665 | 5.5% |
| p | 848962 | 5.0% |
| i | 703184 | 4.2% |
| Other values (14) | 2293438 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 617920 | |
| M | 317072 | |
| P | 232221 | 12.5% |
| B | 211394 | 11.4% |
| C | 168535 | 9.1% |
| A | 139771 | 7.5% |
| O | 50239 | 2.7% |
| H | 37453 | 2.0% |
| T | 25601 | 1.4% |
| E | 22993 | 1.2% |
| Other values (9) | 37039 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 10 | |
| 1 | 7 | |
| 0 | 5 | |
| 4 | 3 | 8.8% |
| 3 | 3 | 8.8% |
| 5 | 3 | 8.8% |
| 8 | 2 | 5.9% |
| 9 | 1 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18701654 | |
| Common | 44 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4042336 | |
| o | 2534615 | |
| t | 1401870 | 7.5% |
| r | 1169735 | 6.3% |
| s | 1022238 | 5.5% |
| d | 956343 | 5.1% |
| c | 944030 | 5.0% |
| l | 924665 | 4.9% |
| p | 848962 | 4.5% |
| i | 703184 | 3.8% |
| Other values (33) | 4153676 |
Common
| Value | Count | Frequency (%) |
| 2 | 10 | |
| 1 | 7 | |
| 0 | 5 | |
| - | 4 | 9.1% |
| : | 4 | 9.1% |
| 4 | 3 | 6.8% |
| 3 | 3 | 6.8% |
| 5 | 3 | 6.8% |
| . | 2 | 4.5% |
| 8 | 2 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18701698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4042336 | |
| o | 2534615 | |
| t | 1401870 | 7.5% |
| r | 1169735 | 6.3% |
| s | 1022238 | 5.5% |
| d | 956343 | 5.1% |
| c | 944030 | 5.0% |
| l | 924665 | 4.9% |
| p | 848962 | 4.5% |
| i | 703184 | 3.8% |
| Other values (44) | 4153720 |
order
Text
Missing 
| Distinct | 414 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 329537 |
| Missing (%) | 17.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 11.19175304 |
| Min length | 5 |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Leucosolenida |
|---|---|
| 2nd row | Cephalaspidea |
| 3rd row | Antipatharia |
| 4th row | Amphilepidida |
| 5th row | Littorinimorpha |
| Value | Count | Frequency (%) |
| decapoda | 196384 | 12.3% |
| neogastropoda | 156428 | 9.8% |
| stylommatophora | 116401 | 7.3% |
| littorinimorpha | 113553 | 7.1% |
| phyllodocida | 69439 | 4.3% |
| scleractinia | 54200 | 3.4% |
| amphipoda | 49533 | 3.1% |
| rhabditida | 35176 | 2.2% |
| venerida | 31275 | 2.0% |
| cardiida | 30439 | 1.9% |
| Other values (404) | 744028 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2716551 | |
| o | 2130021 | |
| i | 1739041 | 9.7% |
| d | 1413506 | 7.9% |
| t | 1052242 | 5.9% |
| p | 961716 | 5.4% |
| r | 907952 | 5.1% |
| e | 872746 | 4.9% |
| c | 825635 | 4.6% |
| l | 796133 | 4.5% |
| Other values (36) | 4456075 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16274762 | |
| Uppercase Letter | 1596856 | 8.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2716551 | |
| o | 2130021 | |
| i | 1739041 | |
| d | 1413506 | |
| t | 1052242 | 6.5% |
| p | 961716 | 5.9% |
| r | 907952 | 5.6% |
| e | 872746 | 5.4% |
| c | 825635 | 5.1% |
| l | 796133 | 4.9% |
| Other values (14) | 2859219 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 253706 | |
| D | 219372 | |
| N | 170936 | |
| C | 159863 | |
| P | 151094 | |
| L | 149326 | |
| A | 130604 | |
| E | 54026 | 3.4% |
| M | 49031 | 3.1% |
| T | 43142 | 2.7% |
| Other values (12) | 215756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17871618 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2716551 | |
| o | 2130021 | |
| i | 1739041 | 9.7% |
| d | 1413506 | 7.9% |
| t | 1052242 | 5.9% |
| p | 961716 | 5.4% |
| r | 907952 | 5.1% |
| e | 872746 | 4.9% |
| c | 825635 | 4.6% |
| l | 796133 | 4.5% |
| Other values (36) | 4456075 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17871618 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2716551 | |
| o | 2130021 | |
| i | 1739041 | 9.7% |
| d | 1413506 | 7.9% |
| t | 1052242 | 5.9% |
| p | 961716 | 5.4% |
| r | 907952 | 5.1% |
| e | 872746 | 4.9% |
| c | 825635 | 4.6% |
| l | 796133 | 4.5% |
| Other values (36) | 4456075 |
family
Text
Missing 
| Distinct | 3522 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 144488 |
| Missing (%) | 7.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 21 |
| Mean length | 11.20729837 |
| Min length | 6 |
Unique
| Unique | 272 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Syconidae |
|---|---|
| 2nd row | Bullidae |
| 3rd row | Stylopathidae |
| 4th row | Ophiotrichidae |
| 5th row | Cypraeidae |
| Value | Count | Frequency (%) |
| cambaridae | 28956 | 1.6% |
| conidae | 28425 | 1.6% |
| unionidae | 26787 | 1.5% |
| muricidae | 22783 | 1.3% |
| veneridae | 18640 | 1.0% |
| cypraeidae | 16831 | 0.9% |
| cerithiidae | 16777 | 0.9% |
| spionidae | 15856 | 0.9% |
| syllidae | 14115 | 0.8% |
| pectinidae | 12961 | 0.7% |
| Other values (3512) | 1579774 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2971768 | |
| a | 2739603 | |
| e | 2656666 | |
| d | 2019505 | |
| o | 1034346 | 5.2% |
| l | 1016313 | 5.1% |
| r | 1015168 | 5.1% |
| n | 842574 | 4.2% |
| t | 674978 | 3.4% |
| c | 545975 | 2.7% |
| Other values (42) | 4453445 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18188436 | |
| Uppercase Letter | 1781905 | 8.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2971768 | |
| a | 2739603 | |
| e | 2656666 | |
| d | 2019505 | |
| o | 1034346 | 5.7% |
| l | 1016313 | 5.6% |
| r | 1015168 | 5.6% |
| n | 842574 | 4.6% |
| t | 674978 | 3.7% |
| c | 545975 | 3.0% |
| Other values (16) | 2671540 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 300023 | |
| P | 268312 | |
| A | 153050 | |
| S | 152792 | |
| M | 116985 | 6.6% |
| T | 108088 | 6.1% |
| L | 87632 | 4.9% |
| O | 80760 | 4.5% |
| E | 66802 | 3.7% |
| N | 66686 | 3.7% |
| Other values (16) | 380775 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19970341 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2971768 | |
| a | 2739603 | |
| e | 2656666 | |
| d | 2019505 | |
| o | 1034346 | 5.2% |
| l | 1016313 | 5.1% |
| r | 1015168 | 5.1% |
| n | 842574 | 4.2% |
| t | 674978 | 3.4% |
| c | 545975 | 2.7% |
| Other values (42) | 4453445 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19970341 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2971768 | |
| a | 2739603 | |
| e | 2656666 | |
| d | 2019505 | |
| o | 1034346 | 5.2% |
| l | 1016313 | 5.1% |
| r | 1015168 | 5.1% |
| n | 842574 | 4.2% |
| t | 674978 | 3.4% |
| c | 545975 | 2.7% |
| Other values (42) | 4453445 |
subtribe
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 130 |
|---|---|
| Median length | 89 |
| Mean length | 89 |
| Min length | 48 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 1 | |
| occurrence_status_inferred_from_individual_count | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 17 | |
| E | 16 | 9.0% |
| N | 16 | 9.0% |
| I | 15 | 8.4% |
| T | 13 | 7.3% |
| D | 13 | 7.3% |
| R | 13 | 7.3% |
| C | 12 | 6.7% |
| O | 12 | 6.7% |
| U | 10 | 5.6% |
| Other values (11) | 41 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 156 | |
| Connector Punctuation | 17 | 9.6% |
| Other Punctuation | 3 | 1.7% |
| Decimal Number | 2 | 1.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 16 | |
| N | 16 | |
| I | 15 | |
| T | 13 | |
| D | 13 | |
| R | 13 | |
| C | 12 | |
| O | 12 | |
| U | 10 | 6.4% |
| A | 8 | 5.1% |
| Other values (7) | 28 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 17 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 156 | |
| Common | 22 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 16 | |
| N | 16 | |
| I | 15 | |
| T | 13 | |
| D | 13 | |
| R | 13 | |
| C | 12 | |
| O | 12 | |
| U | 10 | 6.4% |
| A | 8 | 5.1% |
| Other values (7) | 28 |
Common
| Value | Count | Frequency (%) |
| _ | 17 | |
| ; | 3 | 13.6% |
| 8 | 1 | 4.5% |
| 4 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 17 | |
| E | 16 | 9.0% |
| N | 16 | 9.0% |
| I | 15 | 8.4% |
| T | 13 | 7.3% |
| D | 13 | 7.3% |
| R | 13 | 7.3% |
| C | 12 | 6.7% |
| O | 12 | 6.7% |
| U | 10 | 5.6% |
| Other values (11) | 41 |
genus
Text
Missing 
| Distinct | 20787 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 358044 |
| Missing (%) | 18.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 9.482777111 |
| Min length | 2 |
Unique
| Unique | 3152 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Sycon |
|---|---|
| 2nd row | Bulla |
| 3rd row | Stylopathes |
| 4th row | Ophiothrix |
| 5th row | Naria |
| Value | Count | Frequency (%) |
| conus | 22884 | 1.5% |
| cerithium | 8956 | 0.6% |
| cambarus | 8948 | 0.6% |
| faxonius | 8189 | 0.5% |
| procambarus | 8096 | 0.5% |
| aricidea | 5223 | 0.3% |
| nerita | 4536 | 0.3% |
| nassarius | 4534 | 0.3% |
| pagurus | 4234 | 0.3% |
| elimia | 4085 | 0.3% |
| Other values (20777) | 1488664 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1794417 | 12.1% |
| i | 1296042 | 8.7% |
| o | 1190619 | 8.0% |
| e | 1030226 | 6.9% |
| r | 967232 | 6.5% |
| l | 958555 | 6.4% |
| s | 949415 | 6.4% |
| n | 726098 | 4.9% |
| t | 714218 | 4.8% |
| u | 705352 | 4.7% |
| Other values (42) | 4540130 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13303955 | |
| Uppercase Letter | 1568349 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1794417 | |
| i | 1296042 | |
| o | 1190619 | |
| e | 1030226 | 7.7% |
| r | 967232 | 7.3% |
| l | 958555 | 7.2% |
| s | 949415 | 7.1% |
| n | 726098 | 5.5% |
| t | 714218 | 5.4% |
| u | 705352 | 5.3% |
| Other values (16) | 2971781 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 224909 | |
| C | 211243 | |
| A | 159037 | |
| S | 120812 | 7.7% |
| M | 103354 | 6.6% |
| L | 96699 | 6.2% |
| E | 88844 | 5.7% |
| T | 88317 | 5.6% |
| O | 65295 | 4.2% |
| H | 63671 | 4.1% |
| Other values (16) | 346168 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14872304 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1794417 | 12.1% |
| i | 1296042 | 8.7% |
| o | 1190619 | 8.0% |
| e | 1030226 | 6.9% |
| r | 967232 | 6.5% |
| l | 958555 | 6.4% |
| s | 949415 | 6.4% |
| n | 726098 | 4.9% |
| t | 714218 | 4.8% |
| u | 705352 | 4.7% |
| Other values (42) | 4540130 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14872304 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1794417 | 12.1% |
| i | 1296042 | 8.7% |
| o | 1190619 | 8.0% |
| e | 1030226 | 6.9% |
| r | 967232 | 6.5% |
| l | 958555 | 6.4% |
| s | 949415 | 6.4% |
| n | 726098 | 4.9% |
| t | 714218 | 4.8% |
| u | 705352 | 4.7% |
| Other values (42) | 4540130 |
genericName
Text
Missing 
| Distinct | 21084 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 358043 |
| Missing (%) | 18.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 9.309154844 |
| Min length | 1 |
Unique
| Unique | 3830 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Scypha |
|---|---|
| 2nd row | Bulla |
| 3rd row | Stylopathes |
| 4th row | Ophiothrix |
| 5th row | Cypraea |
| Value | Count | Frequency (%) |
| conus | 24156 | 1.5% |
| cypraea | 15390 | 1.0% |
| cambarus | 10146 | 0.6% |
| cerithium | 9393 | 0.6% |
| orconectes | 8661 | 0.6% |
| procambarus | 8047 | 0.5% |
| nassarius | 6727 | 0.4% |
| lumbrineris | 4967 | 0.3% |
| terebra | 4662 | 0.3% |
| aricidea | 4572 | 0.3% |
| Other values (21074) | 1471629 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1744079 | 11.9% |
| i | 1263792 | 8.7% |
| o | 1156021 | 7.9% |
| e | 1016938 | 7.0% |
| r | 967987 | 6.6% |
| s | 938349 | 6.4% |
| l | 915577 | 6.3% |
| t | 706525 | 4.8% |
| n | 704068 | 4.8% |
| u | 686498 | 4.7% |
| Other values (44) | 4500179 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13031665 | |
| Uppercase Letter | 1568347 | 10.7% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1744079 | |
| i | 1263792 | |
| o | 1156021 | 8.9% |
| e | 1016938 | 7.8% |
| r | 967987 | 7.4% |
| s | 938349 | 7.2% |
| l | 915577 | 7.0% |
| t | 706525 | 5.4% |
| n | 704068 | 5.4% |
| u | 686498 | 5.3% |
| Other values (17) | 2931831 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 229066 | |
| P | 219840 | |
| A | 154961 | |
| S | 126006 | 8.0% |
| M | 103229 | 6.6% |
| T | 96207 | 6.1% |
| L | 90921 | 5.8% |
| E | 82417 | 5.3% |
| O | 74791 | 4.8% |
| H | 62524 | 4.0% |
| Other values (16) | 328385 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14600012 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1744079 | 11.9% |
| i | 1263792 | 8.7% |
| o | 1156021 | 7.9% |
| e | 1016938 | 7.0% |
| r | 967987 | 6.6% |
| s | 938349 | 6.4% |
| l | 915577 | 6.3% |
| t | 706525 | 4.8% |
| n | 704068 | 4.8% |
| u | 686498 | 4.7% |
| Other values (43) | 4500178 |
Common
| Value | Count | Frequency (%) |
| ? | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14600012 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1744079 | 11.9% |
| i | 1263792 | 8.7% |
| o | 1156021 | 7.9% |
| e | 1016938 | 7.0% |
| r | 967987 | 6.6% |
| s | 938349 | 6.4% |
| l | 915577 | 6.3% |
| t | 706525 | 4.8% |
| n | 704068 | 4.8% |
| u | 686498 | 4.7% |
| Other values (43) | 4500178 |
None
| Value | Count | Frequency (%) |
| ö | 1 |
subgenus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| Value | Count | Frequency (%) |
| false | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6482728 |
|---|---|
| 2nd row | 2504455 |
| Value | Count | Frequency (%) |
| 6482728 | 1 | |
| 2504455 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
specificEpithet
Text
Missing 
| Distinct | 39412 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 626798 |
| Missing (%) | 32.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 8.507768189 |
| Min length | 2 |
Unique
| Unique | 9920 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | striata |
|---|---|
| 2nd row | columnaris |
| 3rd row | suensonii |
| 4th row | labrolineata |
| 5th row | heteractis |
| Value | Count | Frequency (%) |
| gracilis | 6098 | 0.5% |
| fragilis | 3477 | 0.3% |
| affinis | 3341 | 0.3% |
| elegans | 3182 | 0.2% |
| aculeata | 3066 | 0.2% |
| borealis | 2967 | 0.2% |
| americanus | 2637 | 0.2% |
| grandis | 2519 | 0.2% |
| acutus | 2312 | 0.2% |
| tenuis | 2265 | 0.2% |
| Other values (39402) | 1267731 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1553197 | |
| i | 1250540 | |
| s | 956883 | 8.7% |
| e | 779958 | 7.1% |
| r | 771552 | 7.0% |
| t | 706671 | 6.4% |
| u | 704699 | 6.4% |
| n | 690520 | 6.2% |
| l | 660182 | 6.0% |
| c | 552656 | 5.0% |
| Other values (28) | 2429795 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11056627 | |
| Decimal Number | 14 | < 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1553197 | |
| i | 1250540 | |
| s | 956883 | 8.7% |
| e | 779958 | 7.1% |
| r | 771552 | 7.0% |
| t | 706671 | 6.4% |
| u | 704699 | 6.4% |
| n | 690520 | 6.2% |
| l | 660182 | 6.0% |
| c | 552656 | 5.0% |
| Other values (20) | 2429769 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 3 | |
| 4 | 3 | |
| 8 | 2 | |
| 0 | 1 | 7.1% |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11056627 | |
| Common | 26 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1553197 | |
| i | 1250540 | |
| s | 956883 | 8.7% |
| e | 779958 | 7.1% |
| r | 771552 | 7.0% |
| t | 706671 | 6.4% |
| u | 704699 | 6.4% |
| n | 690520 | 6.2% |
| l | 660182 | 6.0% |
| c | 552656 | 5.0% |
| Other values (20) | 2429769 |
Common
| Value | Count | Frequency (%) |
| - | 12 | |
| 2 | 3 | 11.5% |
| 5 | 3 | 11.5% |
| 4 | 3 | 11.5% |
| 8 | 2 | 7.7% |
| 0 | 1 | 3.8% |
| 6 | 1 | 3.8% |
| 7 | 1 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11056153 | |
| None | 500 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1553197 | |
| i | 1250540 | |
| s | 956883 | 8.7% |
| e | 779958 | 7.1% |
| r | 771552 | 7.0% |
| t | 706671 | 6.4% |
| u | 704699 | 6.4% |
| n | 690520 | 6.2% |
| l | 660182 | 6.0% |
| c | 552656 | 5.0% |
| Other values (24) | 2429295 |
None
| Value | Count | Frequency (%) |
| ü | 308 | |
| ö | 117 | 23.4% |
| ë | 73 | 14.6% |
| ä | 2 | 0.4% |
Missing 
| Distinct | 3653 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 1890289 |
| Missing (%) | 98.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 8.605777753 |
| Min length | 1 |
Unique
| Unique | 1259 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | connectens |
|---|---|
| 2nd row | laevis |
| 3rd row | schizodontia |
| 4th row | antarctica |
| 5th row | sayi |
| Value | Count | Frequency (%) |
| acutus | 1011 | 2.8% |
| radiata | 616 | 1.7% |
| bartonii | 521 | 1.4% |
| gibbosus | 501 | 1.4% |
| appressa | 443 | 1.2% |
| campanulatum | 379 | 1.0% |
| longimanus | 359 | 1.0% |
| carinata | 350 | 1.0% |
| floridana | 283 | 0.8% |
| trivolvis | 273 | 0.8% |
| Other values (3643) | 31368 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 45988 | |
| i | 33598 | |
| s | 29641 | |
| e | 22986 | 7.4% |
| n | 22086 | 7.1% |
| u | 19813 | 6.4% |
| r | 19186 | 6.2% |
| t | 17670 | 5.7% |
| l | 16838 | 5.4% |
| c | 16647 | 5.4% |
| Other values (19) | 66250 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 310700 | |
| Decimal Number | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 45988 | |
| i | 33598 | |
| s | 29641 | |
| e | 22986 | 7.4% |
| n | 22086 | 7.1% |
| u | 19813 | 6.4% |
| r | 19186 | 6.2% |
| t | 17670 | 5.7% |
| l | 16838 | 5.4% |
| c | 16647 | 5.4% |
| Other values (17) | 66247 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 310700 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 45988 | |
| i | 33598 | |
| s | 29641 | |
| e | 22986 | 7.4% |
| n | 22086 | 7.1% |
| u | 19813 | 6.4% |
| r | 19186 | 6.2% |
| t | 17670 | 5.7% |
| l | 16838 | 5.4% |
| c | 16647 | 5.4% |
| Other values (17) | 66247 |
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 310683 | |
| None | 20 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 45988 | |
| i | 33598 | |
| s | 29641 | |
| e | 22986 | 7.4% |
| n | 22086 | 7.1% |
| u | 19813 | 6.4% |
| r | 19186 | 6.2% |
| t | 17670 | 5.7% |
| l | 16838 | 5.4% |
| c | 16647 | 5.4% |
| Other values (18) | 66230 |
None
| Value | Count | Frequency (%) |
| ö | 20 |
cultivarEpithet
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 108 |
|---|---|
| 2nd row | 108 |
| Value | Count | Frequency (%) |
| 108 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 8 | 2 |
taxonRank
Text
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.539588038 |
| Min length | 3 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GENUS |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 1263491 | |
| genus | 268755 | 14.0% |
| family | 216656 | 11.2% |
| class | 63569 | 3.3% |
| phylum | 48164 | 2.5% |
| subspecies | 32829 | 1.7% |
| order | 26813 | 1.4% |
| kingdom | 2836 | 0.1% |
| variety | 2500 | 0.1% |
| form | 773 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 3021362 | |
| E | 2890709 | |
| I | 1518312 | |
| C | 1359889 | |
| P | 1344484 | |
| U | 349749 | 2.8% |
| L | 328389 | 2.6% |
| A | 282726 | 2.2% |
| N | 271593 | 2.2% |
| G | 271591 | 2.2% |
| Other values (19) | 958993 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12597784 | |
| Decimal Number | 13 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3021362 | |
| E | 2890709 | |
| I | 1518312 | |
| C | 1359889 | |
| P | 1344484 | |
| U | 349749 | 2.8% |
| L | 328389 | 2.6% |
| A | 282726 | 2.2% |
| N | 271593 | 2.2% |
| G | 271591 | 2.2% |
| Other values (11) | 958980 | 7.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 1 | 2 | |
| 5 | 2 | |
| 9 | 1 | 7.7% |
| 6 | 1 | 7.7% |
| 7 | 1 | 7.7% |
| 8 | 1 | 7.7% |
| 3 | 1 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12597784 | |
| Common | 13 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 3021362 | |
| E | 2890709 | |
| I | 1518312 | |
| C | 1359889 | |
| P | 1344484 | |
| U | 349749 | 2.8% |
| L | 328389 | 2.6% |
| A | 282726 | 2.2% |
| N | 271593 | 2.2% |
| G | 271591 | 2.2% |
| Other values (11) | 958980 | 7.6% |
Common
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 1 | 2 | |
| 5 | 2 | |
| 9 | 1 | 7.7% |
| 6 | 1 | 7.7% |
| 7 | 1 | 7.7% |
| 8 | 1 | 7.7% |
| 3 | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12597797 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 3021362 | |
| E | 2890709 | |
| I | 1518312 | |
| C | 1359889 | |
| P | 1344484 | |
| U | 349749 | 2.8% |
| L | 328389 | 2.6% |
| A | 282726 | 2.2% |
| N | 271593 | 2.2% |
| G | 271591 | 2.2% |
| Other values (19) | 958993 | 7.6% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 891 |
|---|---|
| 2nd row | 434 |
| Value | Count | Frequency (%) |
| 891 | 1 | |
| 434 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 3 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 3 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 3 | 1 |
vernacularName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5954 |
|---|---|
| 2nd row | 6426 |
| Value | Count | Frequency (%) |
| 5954 | 1 | |
| 6426 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 | |
| 6 | 2 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 | |
| 6 | 2 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 | |
| 6 | 2 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 4 | 2 | |
| 6 | 2 | |
| 9 | 1 | |
| 2 | 1 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926389 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 11 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6482725 |
|---|---|
| 2nd row | Van Cleave, H. J. |
| 3rd row | Schwartz, Ben |
| 4th row | 2504454 |
| Value | Count | Frequency (%) |
| 6482725 | 1 | |
| van | 1 | |
| cleave | 1 | |
| h | 1 | |
| j | 1 | |
| schwartz | 1 | |
| ben | 1 | |
| 2504454 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 9.1% | |
| 4 | 4 | 9.1% |
| 2 | 3 | 6.8% |
| 5 | 3 | 6.8% |
| a | 3 | 6.8% |
| e | 3 | 6.8% |
| . | 2 | 4.5% |
| , | 2 | 4.5% |
| n | 2 | 4.5% |
| w | 1 | 2.3% |
| Other values (17) | 17 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Decimal Number | 14 | |
| Uppercase Letter | 6 | 13.6% |
| Space Separator | 4 | 9.1% |
| Other Punctuation | 4 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| n | 2 | |
| w | 1 | 6.2% |
| h | 1 | 6.2% |
| r | 1 | 6.2% |
| t | 1 | 6.2% |
| z | 1 | 6.2% |
| c | 1 | 6.2% |
| v | 1 | 6.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 2 | 3 | |
| 5 | 3 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| B | 1 | |
| J | 1 | |
| H | 1 | |
| C | 1 | |
| V | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 22 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| n | 2 | 9.1% |
| w | 1 | 4.5% |
| h | 1 | 4.5% |
| r | 1 | 4.5% |
| S | 1 | 4.5% |
| t | 1 | 4.5% |
| z | 1 | 4.5% |
| B | 1 | 4.5% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| 4 | 4 | |
| 2 | 3 | |
| 5 | 3 | |
| . | 2 | |
| , | 2 | |
| 6 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| 8 | 1 | 4.5% |
| 0 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 9.1% | |
| 4 | 4 | 9.1% |
| 2 | 3 | 6.8% |
| 5 | 3 | 6.8% |
| a | 3 | 6.8% |
| e | 3 | 6.8% |
| . | 2 | 4.5% |
| , | 2 | 4.5% |
| n | 2 | 4.5% |
| w | 1 | 2.3% |
| Other values (17) | 17 |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2071 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.818195707 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SYNONYM |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | SYNONYM |
| Value | Count | Frequency (%) |
| accepted | 1560511 | |
| synonym | 349850 | 18.2% |
| doubtful | 13961 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 3121022 | |
| E | 3121022 | |
| T | 1574472 | |
| D | 1574472 | |
| A | 1560511 | |
| P | 1560511 | |
| Y | 699700 | 4.7% |
| N | 699700 | 4.7% |
| O | 363811 | 2.4% |
| S | 349850 | 2.3% |
| Other values (5) | 419655 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15044726 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3121022 | |
| E | 3121022 | |
| T | 1574472 | |
| D | 1574472 | |
| A | 1560511 | |
| P | 1560511 | |
| Y | 699700 | 4.7% |
| N | 699700 | 4.7% |
| O | 363811 | 2.4% |
| S | 349850 | 2.3% |
| Other values (5) | 419655 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15044726 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 3121022 | |
| E | 3121022 | |
| T | 1574472 | |
| D | 1574472 | |
| A | 1560511 | |
| P | 1560511 | |
| Y | 699700 | 4.7% |
| N | 699700 | 4.7% |
| O | 363811 | 2.4% |
| S | 349850 | 2.3% |
| Other values (5) | 419655 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15044726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 3121022 | |
| E | 3121022 | |
| T | 1574472 | |
| D | 1574472 | |
| A | 1560511 | |
| P | 1560511 | |
| Y | 699700 | 4.7% |
| N | 699700 | 4.7% |
| O | 363811 | 2.4% |
| S | 349850 | 2.3% |
| Other values (5) | 419655 | 2.8% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926391 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6482728 |
|---|---|
| 2nd row | 2504455 |
| Value | Count | Frequency (%) |
| 6482728 | 1 | |
| 2504455 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 8 | 2 | |
| 6 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 0 | 1 | 7.1% |
taxonRemarks
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 16.33333333 |
| Min length | 8 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Hemionchos striatus |
|---|---|
| 2nd row | Nematoda |
| 3rd row | Conspicuum icteridorum |
| Value | Count | Frequency (%) |
| hemionchos | 1 | |
| striatus | 1 | |
| nematoda | 1 | |
| conspicuum | 1 | |
| icteridorum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5 | |
| o | 5 | |
| m | 4 | 8.2% |
| s | 4 | 8.2% |
| t | 4 | 8.2% |
| u | 4 | 8.2% |
| c | 3 | 6.1% |
| e | 3 | 6.1% |
| r | 3 | 6.1% |
| a | 3 | 6.1% |
| Other values (8) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44 | |
| Uppercase Letter | 3 | 6.1% |
| Space Separator | 2 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| o | 5 | |
| m | 4 | |
| s | 4 | |
| t | 4 | |
| u | 4 | |
| c | 3 | |
| e | 3 | |
| r | 3 | |
| a | 3 | |
| Other values (4) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| H | 1 | |
| N | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47 | |
| Common | 2 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5 | |
| o | 5 | |
| m | 4 | |
| s | 4 | |
| t | 4 | |
| u | 4 | |
| c | 3 | 6.4% |
| e | 3 | 6.4% |
| r | 3 | 6.4% |
| a | 3 | 6.4% |
| Other values (7) | 9 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 5 | |
| o | 5 | |
| m | 4 | 8.2% |
| s | 4 | 8.2% |
| t | 4 | 8.2% |
| u | 4 | 8.2% |
| c | 3 | 6.1% |
| e | 3 | 6.1% |
| r | 3 | 6.1% |
| a | 3 | 6.1% |
| Other values (8) | 11 |
datasetKey
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 36 |
| Mean length | 36.00000831 |
| Min length | 36 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 1926387 | |
| 2 | < 0.1% | |
| hemionchos | 1 | < 0.1% |
| striatus | 1 | < 0.1% |
| campbell | 1 | < 0.1% |
| beveridge | 1 | < 0.1% |
| 2006 | 1 | < 0.1% |
| conspicuum | 1 | < 0.1% |
| icteridorum | 1 | < 0.1% |
| denton | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 7705551 | |
| a | 7705550 | |
| - | 7705548 | |
| 2 | 5779162 | |
| b | 5779162 | |
| 4 | 5779161 | |
| d | 3852777 | 5.6% |
| 9 | 3852775 | 5.6% |
| 5 | 3852775 | 5.6% |
| 8 | 3852774 | 5.6% |
| Other values (27) | 13484785 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34674974 | |
| Lowercase Letter | 26969478 | |
| Dash Punctuation | 7705548 | 11.1% |
| Space Separator | 10 | < 0.1% |
| Uppercase Letter | 6 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 7705551 | |
| a | 7705550 | |
| b | 5779162 | |
| d | 3852777 | |
| e | 1926394 | 7.1% |
| i | 6 | < 0.1% |
| r | 5 | < 0.1% |
| o | 5 | < 0.1% |
| m | 4 | < 0.1% |
| n | 4 | < 0.1% |
| Other values (9) | 20 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5779162 | |
| 4 | 5779161 | |
| 9 | 3852775 | |
| 5 | 3852775 | |
| 8 | 3852774 | |
| 3 | 3852774 | |
| 1 | 1926389 | 5.6% |
| 0 | 1926389 | 5.6% |
| 6 | 1926388 | 5.6% |
| 7 | 1926387 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| C | 2 | |
| H | 1 | |
| D | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 | |
| & | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7705548 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42380536 | |
| Latin | 26969484 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 7705551 | |
| a | 7705550 | |
| b | 5779162 | |
| d | 3852777 | |
| e | 1926394 | 7.1% |
| i | 6 | < 0.1% |
| r | 5 | < 0.1% |
| o | 5 | < 0.1% |
| m | 4 | < 0.1% |
| n | 4 | < 0.1% |
| Other values (13) | 26 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 7705548 | |
| 2 | 5779162 | |
| 4 | 5779161 | |
| 9 | 3852775 | |
| 5 | 3852775 | |
| 8 | 3852774 | |
| 3 | 3852774 | |
| 1 | 1926389 | 4.5% |
| 0 | 1926389 | 4.5% |
| 6 | 1926388 | 4.5% |
| Other values (4) | 1926401 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69350020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 7705551 | |
| a | 7705550 | |
| - | 7705548 | |
| 2 | 5779162 | |
| b | 5779162 | |
| 4 | 5779161 | |
| d | 3852777 | 5.6% |
| 9 | 3852775 | 5.6% |
| 5 | 3852775 | 5.6% |
| 8 | 3852774 | 5.6% |
| Other values (27) | 13484785 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 2 |
| Mean length | 2.000019207 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 1926387 | |
| hemionchos | 1 | < 0.1% |
| striatus | 1 | < 0.1% |
| conspicuum | 1 | < 0.1% |
| icteridorum | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 1926387 | |
| S | 1926387 | |
| i | 5 | < 0.1% |
| u | 4 | < 0.1% |
| o | 4 | < 0.1% |
| s | 4 | < 0.1% |
| m | 3 | < 0.1% |
| c | 3 | < 0.1% |
| t | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (9) | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3852776 | |
| Lowercase Letter | 37 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| u | 4 | |
| o | 4 | |
| s | 4 | |
| m | 3 | |
| c | 3 | |
| t | 3 | |
| r | 3 | |
| e | 2 | 5.4% |
| n | 2 | 5.4% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1926387 | |
| S | 1926387 | |
| C | 1 | < 0.1% |
| H | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3852813 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 1926387 | |
| S | 1926387 | |
| i | 5 | < 0.1% |
| u | 4 | < 0.1% |
| o | 4 | < 0.1% |
| s | 4 | < 0.1% |
| m | 3 | < 0.1% |
| c | 3 | < 0.1% |
| t | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (8) | 10 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3852815 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 1926387 | |
| S | 1926387 | |
| i | 5 | < 0.1% |
| u | 4 | < 0.1% |
| o | 4 | < 0.1% |
| s | 4 | < 0.1% |
| m | 3 | < 0.1% |
| c | 3 | < 0.1% |
| t | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (9) | 12 | < 0.1% |
lastInterpreted
Text
| Distinct | 209948 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99591152 |
| Min length | 20 |
Unique
| Unique | 9123 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 2024-12-02T13:57:44.311Z |
|---|---|
| 2nd row | 2024-12-02T13:57:20.485Z |
| 3rd row | 2024-12-02T13:57:18.447Z |
| 4th row | 2024-12-02T13:57:45.124Z |
| 5th row | 2024-12-02T13:57:20.489Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:52.889z | 37 | < 0.1% |
| 2024-12-02t13:57:28.783z | 37 | < 0.1% |
| 2024-12-02t13:57:43.700z | 36 | < 0.1% |
| 2024-12-02t13:57:40.815z | 36 | < 0.1% |
| 2024-12-02t13:58:01.714z | 36 | < 0.1% |
| 2024-12-02t13:57:53.093z | 35 | < 0.1% |
| 2024-12-02t13:57:40.927z | 35 | < 0.1% |
| 2024-12-02t13:57:30.406z | 35 | < 0.1% |
| 2024-12-02t13:57:50.671z | 35 | < 0.1% |
| 2024-12-02t13:57:33.269z | 35 | < 0.1% |
| Other values (209938) | 1926030 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| - | 3852774 | |
| : | 3852774 | |
| 4 | 3098095 | 6.7% |
| 5 | 3058765 | 6.6% |
| 3 | 3051121 | 6.6% |
| T | 1926387 | 4.2% |
| Z | 1926387 | 4.2% |
| Other values (5) | 6918983 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32742672 | |
| Other Punctuation | 5777192 | 12.5% |
| Dash Punctuation | 3852774 | 8.3% |
| Uppercase Letter | 3852774 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| 4 | 3098095 | 9.5% |
| 5 | 3058765 | 9.3% |
| 3 | 3051121 | 9.3% |
| 7 | 1479601 | 4.5% |
| 9 | 1231841 | 3.8% |
| 6 | 1162885 | 3.6% |
| 8 | 1120238 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852774 | |
| . | 1924418 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1926387 | |
| Z | 1926387 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3852774 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42372638 | |
| Latin | 3852774 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| - | 3852774 | |
| : | 3852774 | |
| 4 | 3098095 | 7.3% |
| 5 | 3058765 | 7.2% |
| 3 | 3051121 | 7.2% |
| . | 1924418 | 4.5% |
| 7 | 1479601 | 3.5% |
| Other values (3) | 3514964 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 1926387 | |
| Z | 1926387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46225412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| - | 3852774 | |
| : | 3852774 | |
| 4 | 3098095 | 6.7% |
| 5 | 3058765 | 6.6% |
| 3 | 3051121 | 6.6% |
| T | 1926387 | 4.2% |
| Z | 1926387 | 4.2% |
| Other values (5) | 6918983 |
elevation
Text
Missing 
| Distinct | 1093 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 1919570 |
| Missing (%) | 99.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.361424593 |
| Min length | 3 |
Unique
| Unique | 422 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 783.0 |
|---|---|
| 2nd row | 15.0 |
| 3rd row | 160.0 |
| 4th row | 4070.0 |
| 5th row | 870.0 |
| Value | Count | Frequency (%) |
| 1981.0 | 616 | 9.0% |
| 160.0 | 207 | 3.0% |
| 350.0 | 169 | 2.5% |
| 348.0 | 125 | 1.8% |
| 164.0 | 123 | 1.8% |
| 149.0 | 117 | 1.7% |
| 309.0 | 116 | 1.7% |
| 388.0 | 86 | 1.3% |
| 988.0 | 82 | 1.2% |
| 1100.0 | 73 | 1.1% |
| Other values (1083) | 5109 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9340 | |
| . | 6821 | |
| 1 | 4970 | |
| 2 | 2421 | 6.6% |
| 8 | 2381 | 6.5% |
| 3 | 2089 | 5.7% |
| 9 | 1985 | 5.4% |
| 4 | 1733 | 4.7% |
| 5 | 1722 | 4.7% |
| 6 | 1618 | 4.4% |
| Other values (4) | 1501 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29754 | |
| Other Punctuation | 6821 | 18.6% |
| Uppercase Letter | 6 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9340 | |
| 1 | 4970 | |
| 2 | 2421 | 8.1% |
| 8 | 2381 | 8.0% |
| 3 | 2089 | 7.0% |
| 9 | 1985 | 6.7% |
| 4 | 1733 | 5.8% |
| 5 | 1722 | 5.8% |
| 6 | 1618 | 5.4% |
| 7 | 1495 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6821 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36575 | |
| Latin | 6 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9340 | |
| . | 6821 | |
| 1 | 4970 | |
| 2 | 2421 | 6.6% |
| 8 | 2381 | 6.5% |
| 3 | 2089 | 5.7% |
| 9 | 1985 | 5.4% |
| 4 | 1733 | 4.7% |
| 5 | 1722 | 4.7% |
| 6 | 1618 | 4.4% |
Latin
| Value | Count | Frequency (%) |
| E | 2 | |
| M | 2 | |
| L | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9340 | |
| . | 6821 | |
| 1 | 4970 | |
| 2 | 2421 | 6.6% |
| 8 | 2381 | 6.5% |
| 3 | 2089 | 5.7% |
| 9 | 1985 | 5.4% |
| 4 | 1733 | 4.7% |
| 5 | 1722 | 4.7% |
| 6 | 1618 | 4.4% |
| Other values (4) | 1501 | 4.1% |
Missing 
| Distinct | 71 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 1922885 |
| Missing (%) | 99.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 3 |
| Mean length | 3.14395667 |
| Min length | 3 |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 25.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 3089 | |
| 25.0 | 201 | 5.7% |
| 152.5 | 19 | 0.5% |
| 13.0 | 13 | 0.4% |
| 20.0 | 11 | 0.3% |
| 53.0 | 10 | 0.3% |
| 1.5 | 10 | 0.3% |
| 50.0 | 9 | 0.3% |
| 305.0 | 9 | 0.3% |
| 76.0 | 8 | 0.2% |
| Other values (61) | 129 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6593 | |
| . | 3505 | |
| 5 | 365 | 3.3% |
| 2 | 286 | 2.6% |
| 1 | 102 | 0.9% |
| 3 | 56 | 0.5% |
| 7 | 31 | 0.3% |
| 4 | 29 | 0.3% |
| 6 | 20 | 0.2% |
| 8 | 18 | 0.2% |
| Other values (5) | 24 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7512 | |
| Other Punctuation | 3509 | |
| Dash Punctuation | 4 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6593 | |
| 5 | 365 | 4.9% |
| 2 | 286 | 3.8% |
| 1 | 102 | 1.4% |
| 3 | 56 | 0.7% |
| 7 | 31 | 0.4% |
| 4 | 29 | 0.4% |
| 6 | 20 | 0.3% |
| 8 | 18 | 0.2% |
| 9 | 12 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3505 | |
| : | 4 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11025 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6593 | |
| . | 3505 | |
| 5 | 365 | 3.3% |
| 2 | 286 | 2.6% |
| 1 | 102 | 0.9% |
| 3 | 56 | 0.5% |
| 7 | 31 | 0.3% |
| 4 | 29 | 0.3% |
| 6 | 20 | 0.2% |
| 8 | 18 | 0.2% |
| Other values (3) | 20 | 0.2% |
Latin
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11029 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6593 | |
| . | 3505 | |
| 5 | 365 | 3.3% |
| 2 | 286 | 2.6% |
| 1 | 102 | 0.9% |
| 3 | 56 | 0.5% |
| 7 | 31 | 0.3% |
| 4 | 29 | 0.3% |
| 6 | 20 | 0.2% |
| 8 | 18 | 0.2% |
| Other values (5) | 24 | 0.2% |
depth
Text
Missing 
| Distinct | 8763 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 1143682 |
| Missing (%) | 59.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 20 |
| Mean length | 4.480721492 |
| Min length | 3 |
Unique
| Unique | 2354 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 77.0 |
|---|---|
| 2nd row | 225.0 |
| 3rd row | 74.0 |
| 4th row | 265.0 |
| 5th row | 75.0 |
| Value | Count | Frequency (%) |
| 0.5 | 20751 | 2.7% |
| 1.0 | 11235 | 1.4% |
| 84.0 | 9010 | 1.2% |
| 82.0 | 8984 | 1.1% |
| 18.0 | 8775 | 1.1% |
| 15.0 | 8375 | 1.1% |
| 3.0 | 8321 | 1.1% |
| 27.0 | 7674 | 1.0% |
| 55.0 | 7087 | 0.9% |
| 2.0 | 6958 | 0.9% |
| Other values (8753) | 685541 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 874405 | |
| . | 782711 | |
| 1 | 336183 | 9.6% |
| 5 | 309343 | 8.8% |
| 2 | 253724 | 7.2% |
| 3 | 194487 | 5.5% |
| 4 | 177647 | 5.1% |
| 8 | 153099 | 4.4% |
| 6 | 147797 | 4.2% |
| 7 | 141755 | 4.0% |
| Other values (5) | 135959 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2724387 | |
| Other Punctuation | 782715 | 22.3% |
| Dash Punctuation | 4 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 874405 | |
| 1 | 336183 | 12.3% |
| 5 | 309343 | 11.4% |
| 2 | 253724 | 9.3% |
| 3 | 194487 | 7.1% |
| 4 | 177647 | 6.5% |
| 8 | 153099 | 5.6% |
| 6 | 147797 | 5.4% |
| 7 | 141755 | 5.2% |
| 9 | 135947 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 782711 | |
| : | 4 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3507106 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 874405 | |
| . | 782711 | |
| 1 | 336183 | 9.6% |
| 5 | 309343 | 8.8% |
| 2 | 253724 | 7.2% |
| 3 | 194487 | 5.5% |
| 4 | 177647 | 5.1% |
| 8 | 153099 | 4.4% |
| 6 | 147797 | 4.2% |
| 7 | 141755 | 4.0% |
| Other values (3) | 135955 | 3.9% |
Latin
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3507110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 874405 | |
| . | 782711 | |
| 1 | 336183 | 9.6% |
| 5 | 309343 | 8.8% |
| 2 | 253724 | 7.2% |
| 3 | 194487 | 5.5% |
| 4 | 177647 | 5.1% |
| 8 | 153099 | 4.4% |
| 6 | 147797 | 4.2% |
| 7 | 141755 | 4.0% |
| Other values (5) | 135959 | 3.9% |
depthAccuracy
Text
Missing 
| Distinct | 1589 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1205339 |
| Missing (%) | 62.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 3.277599181 |
| Min length | 3 |
Unique
| Unique | 320 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 175.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 518729 | |
| 0.5 | 27364 | 3.8% |
| 1.0 | 10273 | 1.4% |
| 2.0 | 8610 | 1.2% |
| 1.5 | 8052 | 1.1% |
| 2.5 | 7638 | 1.1% |
| 4.5 | 5897 | 0.8% |
| 5.0 | 5274 | 0.7% |
| 3.0 | 4946 | 0.7% |
| 9.0 | 3978 | 0.6% |
| Other values (1580) | 120294 | 16.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1221509 | |
| . | 721051 | |
| 5 | 141990 | 6.0% |
| 1 | 65416 | 2.8% |
| 9 | 55282 | 2.3% |
| 2 | 51770 | 2.2% |
| 3 | 27061 | 1.1% |
| 4 | 26874 | 1.1% |
| 7 | 19469 | 0.8% |
| 6 | 17914 | 0.8% |
| Other values (18) | 14990 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1642248 | |
| Other Punctuation | 721052 | |
| Lowercase Letter | 23 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| e | 3 | |
| i | 2 | 8.7% |
| l | 2 | 8.7% |
| t | 2 | 8.7% |
| m | 2 | 8.7% |
| o | 1 | 4.3% |
| s | 1 | 4.3% |
| n | 1 | 4.3% |
| u | 1 | 4.3% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1221509 | |
| 5 | 141990 | 8.6% |
| 1 | 65416 | 4.0% |
| 9 | 55282 | 3.4% |
| 2 | 51770 | 3.2% |
| 3 | 27061 | 1.6% |
| 4 | 26874 | 1.6% |
| 7 | 19469 | 1.2% |
| 6 | 17914 | 1.1% |
| 8 | 14963 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 721051 | |
| , | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| A | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2363301 | |
| Latin | 25 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| e | 3 | |
| i | 2 | 8.0% |
| l | 2 | 8.0% |
| t | 2 | 8.0% |
| m | 2 | 8.0% |
| o | 1 | 4.0% |
| N | 1 | 4.0% |
| s | 1 | 4.0% |
| n | 1 | 4.0% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| 0 | 1221509 | |
| . | 721051 | |
| 5 | 141990 | 6.0% |
| 1 | 65416 | 2.8% |
| 9 | 55282 | 2.3% |
| 2 | 51770 | 2.2% |
| 3 | 27061 | 1.1% |
| 4 | 26874 | 1.1% |
| 7 | 19469 | 0.8% |
| 6 | 17914 | 0.8% |
| Other values (3) | 14965 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2363326 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1221509 | |
| . | 721051 | |
| 5 | 141990 | 6.0% |
| 1 | 65416 | 2.8% |
| 9 | 55282 | 2.3% |
| 2 | 51770 | 2.2% |
| 3 | 27061 | 1.1% |
| 4 | 26874 | 1.1% |
| 7 | 19469 | 0.8% |
| 6 | 17914 | 0.8% |
| Other values (18) | 14990 | 0.6% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 603 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 1917545 |
| Missing (%) | 99.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 12.94586347 |
| Min length | 3 |
Unique
| Unique | 205 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 511.15289545417056 |
| 3rd row | 32.07008492372621 |
| 4th row | 1726.5254814515185 |
| 5th row | 1860.2902638338219 |
| Value | Count | Frequency (%) |
| 0.0 | 2777 | |
| 511.15289545417056 | 887 | 10.0% |
| 365.9456782615661 | 341 | 3.9% |
| 1436.265124532336 | 162 | 1.8% |
| 3843.282664940326 | 125 | 1.4% |
| 3.650579245692265 | 104 | 1.2% |
| 1878.9020459397648 | 83 | 0.9% |
| 1726.5254814515185 | 80 | 0.9% |
| 857.2535535849795 | 75 | 0.8% |
| 1809.5904164098843 | 71 | 0.8% |
| Other values (593) | 4143 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 13488 | |
| 0 | 13325 | |
| 1 | 12344 | |
| 4 | 11004 | |
| 6 | 10191 | |
| 2 | 10054 | |
| 8 | 9236 | |
| 9 | 9120 | |
| 3 | 9109 | |
| . | 8847 | |
| Other values (9) | 7827 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 105688 | |
| Other Punctuation | 8847 | 7.7% |
| Lowercase Letter | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 13488 | |
| 0 | 13325 | |
| 1 | 12344 | |
| 4 | 11004 | |
| 6 | 10191 | |
| 2 | 10054 | |
| 8 | 9236 | |
| 9 | 9120 | |
| 3 | 9109 | |
| 7 | 7817 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8847 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114536 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 13488 | |
| 0 | 13325 | |
| 1 | 12344 | |
| 4 | 11004 | |
| 6 | 10191 | |
| 2 | 10054 | |
| 8 | 9236 | |
| 9 | 9120 | |
| 3 | 9109 | |
| . | 8847 | |
| Other values (2) | 7818 |
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| E | 1 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114545 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 13488 | |
| 0 | 13325 | |
| 1 | 12344 | |
| 4 | 11004 | |
| 6 | 10191 | |
| 2 | 10054 | |
| 8 | 9236 | |
| 9 | 9120 | |
| 3 | 9109 | |
| . | 8847 | |
| Other values (9) | 7827 |
issue
Text
| Distinct | 402 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 209 |
|---|---|
| Median length | 204 |
| Mean length | 89.0387161 |
| Min length | 8 |
Unique
| Unique | 86 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY;CONTINENT_INVALID |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid | 516478 | |
| occurrence_status_inferred_from_individual_count | 418366 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 224592 | |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 212163 | |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country | 195778 | 10.2% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates | 50454 | 2.6% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 36575 | 1.9% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates | 32128 | 1.7% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid | 27721 | 1.4% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;taxon_match_higherrank | 25845 | 1.3% |
| Other values (392) | 186256 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 16496084 | |
| N | 15722593 | 9.2% |
| E | 14675877 | 8.6% |
| I | 14123173 | 8.2% |
| T | 12772362 | 7.4% |
| R | 12496875 | 7.3% |
| D | 11817130 | 6.9% |
| C | 11680988 | 6.8% |
| O | 10988569 | 6.4% |
| U | 10155291 | 5.9% |
| Other values (24) | 40591323 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 150080785 | |
| Connector Punctuation | 16496084 | 9.6% |
| Other Punctuation | 3079317 | 1.8% |
| Decimal Number | 1864072 | 1.1% |
| Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 15722593 | |
| E | 14675877 | |
| I | 14123173 | |
| T | 12772362 | |
| R | 12496875 | |
| D | 11817130 | |
| C | 11680988 | |
| O | 10988569 | 7.3% |
| U | 10155291 | 6.8% |
| A | 7703967 | 5.1% |
| Other values (14) | 27943960 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| e | 1 | |
| m | 1 | |
| t | 1 | |
| o | 1 | |
| d | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 932036 | |
| 4 | 932036 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 16496084 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 3079317 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 150080792 | |
| Common | 21439473 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 15722593 | |
| E | 14675877 | |
| I | 14123173 | |
| T | 12772362 | |
| R | 12496875 | |
| D | 11817130 | |
| C | 11680988 | |
| O | 10988569 | 7.3% |
| U | 10155291 | 6.8% |
| A | 7703967 | 5.1% |
| Other values (20) | 27943967 |
Common
| Value | Count | Frequency (%) |
| _ | 16496084 | |
| ; | 3079317 | 14.4% |
| 8 | 932036 | 4.3% |
| 4 | 932036 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171520265 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 16496084 | |
| N | 15722593 | 9.2% |
| E | 14675877 | 8.6% |
| I | 14123173 | 8.2% |
| T | 12772362 | 7.4% |
| R | 12496875 | 7.3% |
| D | 11817130 | 6.9% |
| C | 11680988 | 6.8% |
| O | 10988569 | 6.4% |
| U | 10155291 | 5.9% |
| Other values (24) | 40591323 |
mediaType
Text
Missing 
| Distinct | 73 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1683241 |
| Missing (%) | 87.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 1704 |
|---|---|
| Median length | 10 |
| Mean length | 13.26034744 |
| Min length | 5 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 220054 | |
| stillimage;stillimage | 12696 | 5.2% |
| stillimage;stillimage;stillimage | 3561 | 1.5% |
| stillimage;stillimage;stillimage;stillimage | 2030 | 0.8% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 1055 | 0.4% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 769 | 0.3% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 533 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 390 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 309 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 213 | 0.1% |
| Other values (63) | 1542 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 630442 | |
| a | 315222 | |
| e | 315222 | |
| S | 315220 | |
| t | 315220 | |
| i | 315220 | |
| I | 315220 | |
| m | 315220 | |
| g | 315220 | |
| ; | 72070 | 2.2% |
| Other values (2) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2521770 | |
| Uppercase Letter | 630440 | 19.6% |
| Other Punctuation | 72070 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 630442 | |
| a | 315222 | |
| e | 315222 | |
| t | 315220 | |
| i | 315220 | |
| m | 315220 | |
| g | 315220 | |
| f | 2 | < 0.1% |
| s | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 315220 | |
| I | 315220 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 72070 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3152210 | |
| Common | 72070 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 630442 | |
| a | 315222 | |
| e | 315222 | |
| S | 315220 | |
| t | 315220 | |
| i | 315220 | |
| I | 315220 | |
| m | 315220 | |
| g | 315220 | |
| f | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| ; | 72070 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3224280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 630442 | |
| a | 315222 | |
| e | 315222 | |
| S | 315220 | |
| t | 315220 | |
| i | 315220 | |
| I | 315220 | |
| m | 315220 | |
| g | 315220 | |
| ; | 72070 | 2.2% |
| Other values (2) | 4 | < 0.1% |
hasCoordinate
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 4 |
| Mean length | 4.481446144 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | true |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 999047 | |
| false | 927340 | |
| latin_america | 1 | < 0.1% |
| echinorhynchus | 1 | < 0.1% |
| lageniformis | 1 | < 0.1% |
| ekbaum | 1 | < 0.1% |
| 1938 | 1 | < 0.1% |
| setaria | 1 | < 0.1% |
| labiatopapillosa | 1 | < 0.1% |
| alessandrini | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1926390 | |
| r | 999054 | |
| t | 999050 | |
| u | 999050 | |
| a | 927351 | |
| l | 927345 | |
| s | 927345 | |
| f | 927341 | |
| i | 9 | < 0.1% |
| 8 | < 0.1% | |
| Other values (33) | 79 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8632965 | |
| Uppercase Letter | 30 | < 0.1% |
| Decimal Number | 12 | < 0.1% |
| Space Separator | 8 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Connector Punctuation | 2 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1926390 | |
| r | 999054 | |
| t | 999050 | |
| u | 999050 | |
| a | 927351 | |
| l | 927345 | |
| s | 927345 | |
| f | 927341 | |
| i | 9 | < 0.1% |
| n | 6 | < 0.1% |
| Other values (10) | 24 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6 | |
| E | 4 | |
| I | 3 | |
| R | 3 | |
| N | 2 | 6.7% |
| T | 2 | 6.7% |
| C | 2 | 6.7% |
| M | 2 | 6.7% |
| S | 2 | 6.7% |
| O | 1 | 3.3% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 4 | |
| 1 | 3 | |
| 3 | 2 | |
| 9 | 2 | |
| 7 | 1 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8632995 | |
| Common | 27 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1926390 | |
| r | 999054 | |
| t | 999050 | |
| u | 999050 | |
| a | 927351 | |
| l | 927345 | |
| s | 927345 | |
| f | 927341 | |
| i | 9 | < 0.1% |
| n | 6 | < 0.1% |
| Other values (23) | 54 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| 8 | 4 | |
| 1 | 3 | 11.1% |
| , | 3 | 11.1% |
| 3 | 2 | 7.4% |
| 9 | 2 | 7.4% |
| _ | 2 | 7.4% |
| 7 | 1 | 3.7% |
| ) | 1 | 3.7% |
| ( | 1 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8633022 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1926390 | |
| r | 999054 | |
| t | 999050 | |
| u | 999050 | |
| a | 927351 | |
| l | 927345 | |
| s | 927345 | |
| f | 927341 | |
| i | 9 | < 0.1% |
| 8 | < 0.1% | |
| Other values (33) | 79 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 5 |
| Mean length | 4.98582114 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 1899057 | |
| true | 27330 | 1.4% |
| north_america | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1899057 | |
| l | 1899057 | |
| s | 1899057 | |
| a | 1899057 | |
| t | 27330 | 0.3% |
| r | 27330 | 0.3% |
| u | 27330 | 0.3% |
| A | 4 | < 0.1% |
| R | 4 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9604605 | |
| Uppercase Letter | 24 | < 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 4 | |
| I | 2 | |
| E | 2 | |
| M | 2 | |
| O | 2 | |
| H | 2 | |
| T | 2 | |
| N | 2 | |
| C | 2 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1899057 | |
| l | 1899057 | |
| s | 1899057 | |
| a | 1899057 | |
| t | 27330 | 0.3% |
| r | 27330 | 0.3% |
| u | 27330 | 0.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9604629 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1899057 | |
| l | 1899057 | |
| s | 1899057 | |
| a | 1899057 | |
| t | 27330 | 0.3% |
| r | 27330 | 0.3% |
| u | 27330 | 0.3% |
| A | 4 | < 0.1% |
| R | 4 | < 0.1% |
| Other values (8) | 16 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9604631 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1899057 | |
| l | 1899057 | |
| s | 1899057 | |
| a | 1899057 | |
| t | 27330 | 0.3% |
| r | 27330 | 0.3% |
| u | 27330 | 0.3% |
| A | 4 | < 0.1% |
| R | 4 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
taxonKey
Text
| Distinct | 113080 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.459736045 |
| Min length | 1 |
Unique
| Unique | 38722 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 2237154 |
|---|---|
| 2nd row | 5189992 |
| 3rd row | 2258402 |
| 4th row | 5187825 |
| 5th row | 6104288 |
| Value | Count | Frequency (%) |
| 225 | 23786 | 1.2% |
| 5967481 | 15294 | 0.8% |
| 105 | 11162 | 0.6% |
| 52 | 8679 | 0.5% |
| 7296 | 8105 | 0.4% |
| 637 | 6531 | 0.3% |
| 137 | 6331 | 0.3% |
| 6540 | 4668 | 0.2% |
| 8166676 | 4580 | 0.2% |
| 256 | 4175 | 0.2% |
| Other values (113070) | 1833077 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2383907 | |
| 5 | 1295476 | |
| 1 | 1222710 | |
| 3 | 1175870 | |
| 8 | 1103667 | |
| 7 | 1093695 | |
| 4 | 1090035 | |
| 6 | 1059267 | |
| 9 | 1047606 | |
| 0 | 971722 | |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12443955 | |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2383907 | |
| 5 | 1295476 | |
| 1 | 1222710 | |
| 3 | 1175870 | |
| 8 | 1103667 | |
| 7 | 1093695 | |
| 4 | 1090035 | |
| 6 | 1059267 | |
| 9 | 1047606 | |
| 0 | 971722 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12443955 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2383907 | |
| 5 | 1295476 | |
| 1 | 1222710 | |
| 3 | 1175870 | |
| 8 | 1103667 | |
| 7 | 1093695 | |
| 4 | 1090035 | |
| 6 | 1059267 | |
| 9 | 1047606 | |
| 0 | 971722 |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12443958 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2383907 | |
| 5 | 1295476 | |
| 1 | 1222710 | |
| 3 | 1175870 | |
| 8 | 1103667 | |
| 7 | 1093695 | |
| 4 | 1090035 | |
| 6 | 1059267 | |
| 9 | 1047606 | |
| 0 | 971722 | |
| Other values (3) | 3 | < 0.1% |
acceptedTaxonKey
Text
| Distinct | 94525 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 2070 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.457625877 |
| Min length | 1 |
Unique
| Unique | 27026 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 2237081 |
|---|---|
| 2nd row | 5189992 |
| 3rd row | 2258402 |
| 4th row | 5187825 |
| 5th row | 9722403 |
| Value | Count | Frequency (%) |
| 225 | 23786 | 1.2% |
| 5967481 | 15294 | 0.8% |
| 105 | 11162 | 0.6% |
| 52 | 8679 | 0.5% |
| 7296 | 8105 | 0.4% |
| 637 | 6531 | 0.3% |
| 137 | 6505 | 0.3% |
| 6540 | 4668 | 0.2% |
| 255 | 4580 | 0.2% |
| 256 | 4175 | 0.2% |
| Other values (94515) | 1830838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 | |
| Other values (6) | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12426552 | |
| Lowercase Letter | 5 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 |
Lowercase Letter
| Value | Count | Frequency (%) |
| é | 1 | |
| x | 1 | |
| i | 1 | |
| c | 1 | |
| o | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12426552 | |
| Latin | 6 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| é | 1 | |
| x | 1 | |
| i | 1 | |
| c | 1 | |
| o | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12426557 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2351717 | |
| 5 | 1313419 | |
| 1 | 1213699 | |
| 3 | 1149846 | |
| 8 | 1103530 | |
| 7 | 1101446 | |
| 4 | 1092660 | |
| 9 | 1069473 | |
| 6 | 1061655 | |
| 0 | 969107 | |
| Other values (5) | 5 | < 0.1% |
None
| Value | Count | Frequency (%) |
| é | 1 |
kingdomKey
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.000003115 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 1920497 | |
| 4 | 2826 | 0.1% |
| 0 | 2065 | 0.1% |
| 7 | 964 | 0.1% |
| 3 | 35 | < 0.1% |
| mex.2_1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1920498 | |
| 4 | 2826 | 0.1% |
| 0 | 2065 | 0.1% |
| 7 | 964 | 0.1% |
| 3 | 35 | < 0.1% |
| M | 1 | < 0.1% |
| E | 1 | < 0.1% |
| X | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1926389 | |
| Uppercase Letter | 3 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1920498 | |
| 4 | 2826 | 0.1% |
| 0 | 2065 | 0.1% |
| 7 | 964 | 0.1% |
| 3 | 35 | < 0.1% |
| 2 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1926391 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1920498 | |
| 4 | 2826 | 0.1% |
| 0 | 2065 | 0.1% |
| 7 | 964 | 0.1% |
| 3 | 35 | < 0.1% |
| . | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| _ | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1926394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1920498 | |
| 4 | 2826 | 0.1% |
| 0 | 2065 | 0.1% |
| 7 | 964 | 0.1% |
| 3 | 35 | < 0.1% |
| M | 1 | < 0.1% |
| E | 1 | < 0.1% |
| X | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
phylumKey
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3161 |
| Missing (%) | 0.2% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 2 |
| Mean length | 2.24715167 |
| Min length | 2 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 105 |
|---|---|
| 2nd row | 52 |
| 3rd row | 43 |
| 4th row | 50 |
| 5th row | 52 |
| Value | Count | Frequency (%) |
| 52 | 864192 | |
| 54 | 392999 | |
| 42 | 241615 | 12.6% |
| 43 | 117703 | 6.1% |
| 50 | 91212 | 4.7% |
| 5967481 | 68758 | 3.6% |
| 108 | 45840 | 2.4% |
| 105 | 32733 | 1.7% |
| 44 | 19745 | 1.0% |
| 74 | 10415 | 0.5% |
| Other values (44) | 38022 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 1482086 | |
| 2 | 1107814 | |
| 4 | 872475 | |
| 0 | 178943 | 4.1% |
| 1 | 152025 | 3.5% |
| 3 | 135402 | 3.1% |
| 8 | 125240 | 2.9% |
| 9 | 93678 | 2.2% |
| 7 | 92947 | 2.2% |
| 6 | 81165 | 1.9% |
| Other values (13) | 19 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4321775 | |
| Lowercase Letter | 14 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1482086 | |
| 2 | 1107814 | |
| 4 | 872475 | |
| 0 | 178943 | 4.1% |
| 1 | 152025 | 3.5% |
| 3 | 135402 | 3.1% |
| 8 | 125240 | 2.9% |
| 9 | 93678 | 2.2% |
| 7 | 92947 | 2.2% |
| 6 | 81165 | 1.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 2 | |
| r | 2 | |
| j | 1 | 7.1% |
| l | 1 | 7.1% |
| f | 1 | 7.1% |
| o | 1 | 7.1% |
| n | 1 | 7.1% |
| u | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 | |
| C | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4321777 | |
| Latin | 17 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 2 | |
| r | 2 | |
| B | 1 | 5.9% |
| j | 1 | 5.9% |
| C | 1 | 5.9% |
| l | 1 | 5.9% |
| f | 1 | 5.9% |
| o | 1 | 5.9% |
| n | 1 | 5.9% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 5 | 1482086 | |
| 2 | 1107814 | |
| 4 | 872475 | |
| 0 | 178943 | 4.1% |
| 1 | 152025 | 3.5% |
| 3 | 135402 | 3.1% |
| 8 | 125240 | 2.9% |
| 9 | 93678 | 2.2% |
| 7 | 92947 | 2.2% |
| 6 | 81165 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4321794 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 1482086 | |
| 2 | 1107814 | |
| 4 | 872475 | |
| 0 | 178943 | 4.1% |
| 1 | 152025 | 3.5% |
| 3 | 135402 | 3.1% |
| 8 | 125240 | 2.9% |
| 9 | 93678 | 2.2% |
| 7 | 92947 | 2.2% |
| 6 | 81165 | 1.9% |
| Other values (13) | 19 | < 0.1% |
classKey
Text
Missing 
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 66158 |
| Missing (%) | 3.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 3.288918873 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 308 |
|---|---|
| 2nd row | 225 |
| 3rd row | 206 |
| 4th row | 350 |
| 5th row | 225 |
| Value | Count | Frequency (%) |
| 225 | 610123 | |
| 229 | 301912 | |
| 256 | 211086 | 11.3% |
| 137 | 207854 | 11.2% |
| 206 | 93050 | 5.0% |
| 11545536 | 46190 | 2.5% |
| 11133537 | 42750 | 2.3% |
| 255 | 30336 | 1.6% |
| 350 | 27087 | 1.5% |
| 214 | 25635 | 1.4% |
| Other values (105) | 264212 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2351939 | |
| 5 | 1212864 | |
| 1 | 577401 | 9.4% |
| 3 | 547057 | 8.9% |
| 6 | 416257 | 6.8% |
| 9 | 358589 | 5.9% |
| 7 | 296942 | 4.9% |
| 4 | 172597 | 2.8% |
| 0 | 166184 | 2.7% |
| 8 | 18326 | 0.3% |
| Other values (5) | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6118156 | |
| Uppercase Letter | 3 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2351939 | |
| 5 | 1212864 | |
| 1 | 577401 | 9.4% |
| 3 | 547057 | 8.9% |
| 6 | 416257 | 6.8% |
| 9 | 358589 | 5.9% |
| 7 | 296942 | 4.9% |
| 4 | 172597 | 2.8% |
| 0 | 166184 | 2.7% |
| 8 | 18326 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6118159 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2351939 | |
| 5 | 1212864 | |
| 1 | 577401 | 9.4% |
| 3 | 547057 | 8.9% |
| 6 | 416257 | 6.8% |
| 9 | 358589 | 5.9% |
| 7 | 296942 | 4.9% |
| 4 | 172597 | 2.8% |
| 0 | 166184 | 2.7% |
| 8 | 18326 | 0.3% |
| Other values (2) | 3 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| E | 1 | |
| X | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6118162 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2351939 | |
| 5 | 1212864 | |
| 1 | 577401 | 9.4% |
| 3 | 547057 | 8.9% |
| 6 | 416257 | 6.8% |
| 9 | 358589 | 5.9% |
| 7 | 296942 | 4.9% |
| 4 | 172597 | 2.8% |
| 0 | 166184 | 2.7% |
| 8 | 18326 | 0.3% |
| Other values (5) | 6 | < 0.1% |
orderKey
Text
Missing 
| Distinct | 418 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 329533 |
| Missing (%) | 17.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 80 |
|---|---|
| Median length | 71 |
| Mean length | 4.576634771 |
| Min length | 3 |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1184 |
|---|---|
| 2nd row | 454 |
| 3rd row | 831 |
| 4th row | 9661062 |
| 5th row | 7390893 |
| Value | Count | Frequency (%) |
| 637 | 196384 | 12.3% |
| 982 | 156428 | 9.8% |
| 1456 | 116401 | 7.3% |
| 7390893 | 113553 | 7.1% |
| 1079 | 69439 | 4.3% |
| 714 | 54200 | 3.4% |
| 1231 | 49533 | 3.1% |
| 440 | 35176 | 2.2% |
| 9310756 | 31275 | 2.0% |
| 9529005 | 30439 | 1.9% |
| Other values (419) | 744045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1021057 | |
| 1 | 883038 | |
| 3 | 812835 | |
| 7 | 809815 | |
| 4 | 735664 | |
| 0 | 683837 | |
| 6 | 677497 | |
| 8 | 654881 | |
| 2 | 518629 | |
| 5 | 510778 | |
| Other values (29) | 214 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7308031 | |
| Lowercase Letter | 172 | < 0.1% |
| Uppercase Letter | 17 | < 0.1% |
| Space Separator | 13 | < 0.1% |
| Other Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30 | |
| i | 16 | |
| h | 16 | |
| e | 14 | |
| o | 14 | |
| n | 14 | |
| c | 12 | 7.0% |
| l | 10 | 5.8% |
| r | 9 | 5.2% |
| d | 7 | 4.1% |
| Other values (8) | 30 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1021057 | |
| 1 | 883038 | |
| 3 | 812835 | |
| 7 | 809815 | |
| 4 | 735664 | |
| 0 | 683837 | |
| 6 | 677497 | |
| 8 | 654881 | |
| 2 | 518629 | |
| 5 | 510778 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 4 | |
| A | 4 | |
| S | 2 | |
| E | 2 | |
| M | 1 | 5.9% |
| N | 1 | 5.9% |
| C | 1 | 5.9% |
| O | 1 | 5.9% |
| L | 1 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 13 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7308056 | |
| Latin | 189 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30 | |
| i | 16 | 8.5% |
| h | 16 | 8.5% |
| e | 14 | 7.4% |
| o | 14 | 7.4% |
| n | 14 | 7.4% |
| c | 12 | 6.3% |
| l | 10 | 5.3% |
| r | 9 | 4.8% |
| d | 7 | 3.7% |
| Other values (17) | 47 |
Common
| Value | Count | Frequency (%) |
| 9 | 1021057 | |
| 1 | 883038 | |
| 3 | 812835 | |
| 7 | 809815 | |
| 4 | 735664 | |
| 0 | 683837 | |
| 6 | 677497 | |
| 8 | 654881 | |
| 2 | 518629 | |
| 5 | 510778 | |
| Other values (2) | 25 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7308245 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 1021057 | |
| 1 | 883038 | |
| 3 | 812835 | |
| 7 | 809815 | |
| 4 | 735664 | |
| 0 | 683837 | |
| 6 | 677497 | |
| 8 | 654881 | |
| 2 | 518629 | |
| 5 | 510778 | |
| Other values (29) | 214 | < 0.1% |
familyKey
Text
Missing 
| Distinct | 3525 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 144485 |
| Missing (%) | 7.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.434517382 |
| Min length | 4 |
Unique
| Unique | 272 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 4305937 |
|---|---|
| 2nd row | 2850 |
| 3rd row | 3362 |
| 4th row | 3249433 |
| 5th row | 2675 |
| Value | Count | Frequency (%) |
| 4479 | 28956 | 1.6% |
| 6779 | 28425 | 1.6% |
| 3461 | 26787 | 1.5% |
| 2304120 | 22783 | 1.3% |
| 3445 | 18640 | 1.0% |
| 2675 | 16831 | 0.9% |
| 6760 | 16777 | 0.9% |
| 3595 | 15856 | 0.9% |
| 3588 | 14115 | 0.8% |
| 3472 | 12961 | 0.7% |
| Other values (3515) | 1579777 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1049135 | |
| 4 | 928679 | |
| 2 | 887515 | |
| 5 | 878476 | |
| 7 | 843755 | |
| 8 | 810631 | |
| 3 | 797950 | |
| 9 | 648125 | |
| 0 | 542346 | |
| 1 | 515266 | |
| Other values (6) | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7901878 | |
| Lowercase Letter | 21 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1049135 | |
| 4 | 928679 | |
| 2 | 887515 | |
| 5 | 878476 | |
| 7 | 843755 | |
| 8 | 810631 | |
| 3 | 797950 | |
| 9 | 648125 | |
| 0 | 542346 | |
| 1 | 515266 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6 | |
| a | 6 | |
| n | 3 | |
| m | 3 | |
| l | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7901878 | |
| Latin | 24 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 1049135 | |
| 4 | 928679 | |
| 2 | 887515 | |
| 5 | 878476 | |
| 7 | 843755 | |
| 8 | 810631 | |
| 3 | 797950 | |
| 9 | 648125 | |
| 0 | 542346 | |
| 1 | 515266 |
Latin
| Value | Count | Frequency (%) |
| i | 6 | |
| a | 6 | |
| A | 3 | |
| n | 3 | |
| m | 3 | |
| l | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7901902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 1049135 | |
| 4 | 928679 | |
| 2 | 887515 | |
| 5 | 878476 | |
| 7 | 843755 | |
| 8 | 810631 | |
| 3 | 797950 | |
| 9 | 648125 | |
| 0 | 542346 | |
| 1 | 515266 | |
| Other values (6) | 24 | < 0.1% |
genusKey
Text
Missing 
| Distinct | 20902 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 358041 |
| Missing (%) | 18.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.014656149 |
| Min length | 7 |
Unique
| Unique | 3190 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2237081 |
|---|---|
| 2nd row | 9798512 |
| 3rd row | 2258400 |
| 4th row | 2275832 |
| 5th row | 4628849 |
| Value | Count | Frequency (%) |
| 9819702 | 22884 | 1.5% |
| 8179898 | 8956 | 0.6% |
| 2227317 | 8948 | 0.6% |
| 4646327 | 8189 | 0.5% |
| 2227127 | 8096 | 0.5% |
| 2318625 | 5223 | 0.3% |
| 5189970 | 4536 | 0.3% |
| 2302962 | 4534 | 0.3% |
| 2224189 | 4234 | 0.3% |
| 2301998 | 4085 | 0.3% |
| Other values (20892) | 1488667 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2691752 | |
| 3 | 1078641 | |
| 4 | 1008012 | 9.2% |
| 1 | 1001812 | 9.1% |
| 9 | 940816 | 8.6% |
| 0 | 923361 | 8.4% |
| 8 | 919570 | 8.4% |
| 7 | 830784 | 7.6% |
| 6 | 809158 | 7.4% |
| 5 | 797507 | 7.2% |
| Other values (17) | 37 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11001413 | |
| Lowercase Letter | 34 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| e | 4 | |
| h | 4 | |
| t | 4 | |
| l | 3 | |
| m | 2 | 5.9% |
| n | 2 | 5.9% |
| c | 2 | 5.9% |
| o | 2 | 5.9% |
| y | 1 | 2.9% |
| Other values (4) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2691752 | |
| 3 | 1078641 | |
| 4 | 1008012 | 9.2% |
| 1 | 1001812 | 9.1% |
| 9 | 940816 | 8.6% |
| 0 | 923361 | 8.4% |
| 8 | 919570 | 8.4% |
| 7 | 830784 | 7.6% |
| 6 | 809158 | 7.4% |
| 5 | 797507 | 7.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11001413 | |
| Latin | 37 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| e | 4 | |
| h | 4 | |
| t | 4 | |
| l | 3 | |
| m | 2 | 5.4% |
| n | 2 | 5.4% |
| c | 2 | 5.4% |
| o | 2 | 5.4% |
| y | 1 | 2.7% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 2 | 2691752 | |
| 3 | 1078641 | |
| 4 | 1008012 | 9.2% |
| 1 | 1001812 | 9.1% |
| 9 | 940816 | 8.6% |
| 0 | 923361 | 8.4% |
| 8 | 919570 | 8.4% |
| 7 | 830784 | 7.6% |
| 6 | 809158 | 7.4% |
| 5 | 797507 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11001450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2691752 | |
| 3 | 1078641 | |
| 4 | 1008012 | 9.2% |
| 1 | 1001812 | 9.1% |
| 9 | 940816 | 8.6% |
| 0 | 923361 | 8.4% |
| 8 | 919570 | 8.4% |
| 7 | 830784 | 7.6% |
| 6 | 809158 | 7.4% |
| 5 | 797507 | 7.2% |
| Other values (17) | 37 | < 0.1% |
subgenusKey
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 1926388 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 11 |
| Mean length | 8.6 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 60.0% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | Palaeacanthocephala |
| 3rd row | Chromadorea |
| 4th row | Monogenea |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 2 | |
| palaeacanthocephala | 1 | |
| chromadorea | 1 | |
| monogenea | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 5 | |
| e | 5 | |
| h | 3 | 7.0% |
| n | 3 | 7.0% |
| r | 2 | 4.7% |
| E | 2 | 4.7% |
| N | 2 | 4.7% |
| c | 2 | 4.7% |
| l | 2 | 4.7% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 | |
| Uppercase Letter | 7 | 16.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 5 | |
| e | 5 | |
| h | 3 | 8.3% |
| n | 3 | 8.3% |
| r | 2 | 5.6% |
| c | 2 | 5.6% |
| l | 2 | 5.6% |
| t | 1 | 2.8% |
| p | 1 | 2.8% |
| Other values (3) | 3 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| N | 2 | |
| C | 1 | |
| P | 1 | |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 5 | |
| e | 5 | |
| h | 3 | 7.0% |
| n | 3 | 7.0% |
| r | 2 | 4.7% |
| E | 2 | 4.7% |
| N | 2 | 4.7% |
| c | 2 | 4.7% |
| l | 2 | 4.7% |
| Other values (8) | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 5 | |
| e | 5 | |
| h | 3 | 7.0% |
| n | 3 | 7.0% |
| r | 2 | 4.7% |
| E | 2 | 4.7% |
| N | 2 | 4.7% |
| c | 2 | 4.7% |
| l | 2 | 4.7% |
| Other values (8) | 8 |
speciesKey
Text
Missing 
| Distinct | 81482 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 626819 |
| Missing (%) | 32.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.042964079 |
| Min length | 7 |
Unique
| Unique | 23449 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 5189992 |
|---|---|
| 2nd row | 2258402 |
| 3rd row | 5187825 |
| 4th row | 9722403 |
| 5th row | 2274145 |
| Value | Count | Frequency (%) |
| 2318104 | 2020 | 0.2% |
| 5728138 | 1518 | 0.1% |
| 7823183 | 1512 | 0.1% |
| 2321421 | 1479 | 0.1% |
| 9029731 | 1415 | 0.1% |
| 2227405 | 1414 | 0.1% |
| 2227381 | 1402 | 0.1% |
| 5724968 | 1368 | 0.1% |
| 2509463 | 1354 | 0.1% |
| 8971201 | 1324 | 0.1% |
| Other values (81472) | 1284768 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1727288 | |
| 5 | 965514 | |
| 1 | 941828 | |
| 3 | 831165 | |
| 8 | 829394 | |
| 7 | 811641 | |
| 9 | 808573 | |
| 4 | 781033 | |
| 6 | 737014 | |
| 0 | 719364 | |
| Other values (18) | 39 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9152814 | |
| Lowercase Letter | 36 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| d | 4 | |
| h | 4 | |
| t | 3 | |
| o | 3 | |
| y | 2 | 5.6% |
| n | 2 | 5.6% |
| c | 2 | 5.6% |
| r | 1 | 2.8% |
| Other values (5) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1727288 | |
| 5 | 965514 | |
| 1 | 941828 | |
| 3 | 831165 | |
| 8 | 829394 | |
| 7 | 811641 | |
| 9 | 808573 | |
| 4 | 781033 | |
| 6 | 737014 | |
| 0 | 719364 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 | |
| E | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9152814 | |
| Latin | 39 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| d | 4 | |
| h | 4 | |
| t | 3 | 7.7% |
| o | 3 | 7.7% |
| y | 2 | 5.1% |
| n | 2 | 5.1% |
| c | 2 | 5.1% |
| r | 1 | 2.6% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 2 | 1727288 | |
| 5 | 965514 | |
| 1 | 941828 | |
| 3 | 831165 | |
| 8 | 829394 | |
| 7 | 811641 | |
| 9 | 808573 | |
| 4 | 781033 | |
| 6 | 737014 | |
| 0 | 719364 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9152853 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1727288 | |
| 5 | 965514 | |
| 1 | 941828 | |
| 3 | 831165 | |
| 8 | 829394 | |
| 7 | 811641 | |
| 9 | 808573 | |
| 4 | 781033 | |
| 6 | 737014 | |
| 0 | 719364 | |
| Other values (18) | 39 | < 0.1% |
species
Text
Missing 
| Distinct | 81449 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 626822 |
| Missing (%) | 32.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 36 |
| Mean length | 18.98173243 |
| Min length | 7 |
Unique
| Unique | 23438 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | Bulla striata |
|---|---|
| 2nd row | Stylopathes columnaris |
| 3rd row | Ophiothrix suensonii |
| 4th row | Naria labrolineata |
| 5th row | Lysasterias heteractis |
| Value | Count | Frequency (%) |
| conus | 21648 | 0.8% |
| cerithium | 8891 | 0.3% |
| cambarus | 8740 | 0.3% |
| faxonius | 8187 | 0.3% |
| procambarus | 8031 | 0.3% |
| gracilis | 6079 | 0.2% |
| aricidea | 4891 | 0.2% |
| nassarius | 4086 | 0.2% |
| pagurus | 3943 | 0.2% |
| oliva | 3823 | 0.1% |
| Other values (55326) | 2520823 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3034205 | |
| i | 2321115 | 9.4% |
| s | 1756467 | 7.1% |
| e | 1634157 | 6.6% |
| r | 1566246 | 6.3% |
| o | 1518962 | 6.2% |
| l | 1442638 | 5.8% |
| t | 1302012 | 5.3% |
| 1299571 | 5.3% | |
| u | 1297622 | 5.3% |
| Other values (44) | 7495114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22068966 | |
| Space Separator | 1299571 | 5.3% |
| Uppercase Letter | 1299571 | 5.3% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3034205 | |
| i | 2321115 | |
| s | 1756467 | 8.0% |
| e | 1634157 | 7.4% |
| r | 1566246 | 7.1% |
| o | 1518962 | 6.9% |
| l | 1442638 | 6.5% |
| t | 1302012 | 5.9% |
| u | 1297622 | 5.9% |
| n | 1297510 | 5.9% |
| Other values (16) | 4898032 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 185835 | |
| C | 181674 | |
| A | 133784 | |
| S | 96420 | 7.4% |
| M | 86379 | 6.6% |
| L | 83596 | 6.4% |
| E | 71837 | 5.5% |
| T | 65730 | 5.1% |
| N | 53880 | 4.1% |
| O | 53677 | 4.1% |
| Other values (16) | 286759 |
Space Separator
| Value | Count | Frequency (%) |
| 1299571 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23368537 | |
| Common | 1299572 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3034205 | |
| i | 2321115 | 9.9% |
| s | 1756467 | 7.5% |
| e | 1634157 | 7.0% |
| r | 1566246 | 6.7% |
| o | 1518962 | 6.5% |
| l | 1442638 | 6.2% |
| t | 1302012 | 5.6% |
| u | 1297622 | 5.6% |
| n | 1297510 | 5.6% |
| Other values (42) | 6197603 |
Common
| Value | Count | Frequency (%) |
| 1299571 | ||
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24668109 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3034205 | |
| i | 2321115 | 9.4% |
| s | 1756467 | 7.1% |
| e | 1634157 | 6.6% |
| r | 1566246 | 6.3% |
| o | 1518962 | 6.2% |
| l | 1442638 | 5.8% |
| t | 1302012 | 5.3% |
| 1299571 | 5.3% | |
| u | 1297622 | 5.3% |
| Other values (44) | 7495114 |
| Distinct | 94525 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 2067 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 188 |
|---|---|
| Median length | 120 |
| Mean length | 29.47398518 |
| Min length | 6 |
Unique
| Unique | 27025 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Sycon Risso, 1827 |
|---|---|
| 2nd row | Bulla striata Bruguière, 1792 |
| 3rd row | Stylopathes columnaris (Duchassaing, 1870) |
| 4th row | Ophiothrix suensonii Lütken, 1856 |
| 5th row | Naria labrolineata (Gaskoin, 1849) |
| Value | Count | Frequency (%) |
| 137132 | 2.0% | |
| linnaeus | 102227 | 1.5% |
| 1758 | 86436 | 1.3% |
| say | 52030 | 0.8% |
| lamarck | 41218 | 0.6% |
| dall | 26280 | 0.4% |
| 1791 | 25378 | 0.4% |
| gmelin | 24581 | 0.4% |
| gastropoda | 23786 | 0.4% |
| conus | 22951 | 0.3% |
| Other values (67808) | 6231685 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4975492 | 8.8% |
| 4849378 | 8.6% | |
| i | 3756798 | 6.6% |
| e | 3416706 | 6.0% |
| r | 2838443 | 5.0% |
| s | 2680069 | 4.7% |
| o | 2507220 | 4.4% |
| l | 2493288 | 4.4% |
| n | 2462018 | 4.3% |
| t | 1946947 | 3.4% |
| Other values (105) | 24791197 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37642616 | |
| Decimal Number | 6282608 | 11.1% |
| Space Separator | 4849378 | 8.6% |
| Uppercase Letter | 4140323 | 7.3% |
| Other Punctuation | 2120049 | 3.7% |
| Close Punctuation | 828480 | 1.5% |
| Open Punctuation | 828480 | 1.5% |
| Dash Punctuation | 25622 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4975492 | |
| i | 3756798 | |
| e | 3416706 | 9.1% |
| r | 2838443 | 7.5% |
| s | 2680069 | 7.1% |
| o | 2507220 | 6.7% |
| l | 2493288 | 6.6% |
| n | 2462018 | 6.5% |
| t | 1946947 | 5.2% |
| u | 1878146 | 5.0% |
| Other values (50) | 8687489 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 395646 | 9.6% |
| P | 384763 | 9.3% |
| C | 374392 | 9.0% |
| L | 361606 | 8.7% |
| A | 294332 | 7.1% |
| M | 289981 | 7.0% |
| B | 240607 | 5.8% |
| H | 228739 | 5.5% |
| G | 219867 | 5.3% |
| E | 172880 | 4.2% |
| Other values (27) | 1177510 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1879993 | |
| 8 | 1317965 | |
| 9 | 685308 | 10.9% |
| 7 | 543716 | 8.7% |
| 5 | 369738 | 5.9% |
| 2 | 318125 | 5.1% |
| 6 | 311114 | 5.0% |
| 0 | 300914 | 4.8% |
| 4 | 287040 | 4.6% |
| 3 | 268695 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1587604 | |
| . | 385539 | 18.2% |
| & | 137136 | 6.5% |
| ' | 9770 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 4849378 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 828480 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 828480 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41782939 | |
| Common | 14934617 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4975492 | 11.9% |
| i | 3756798 | 9.0% |
| e | 3416706 | 8.2% |
| r | 2838443 | 6.8% |
| s | 2680069 | 6.4% |
| o | 2507220 | 6.0% |
| l | 2493288 | 6.0% |
| n | 2462018 | 5.9% |
| t | 1946947 | 4.7% |
| u | 1878146 | 4.5% |
| Other values (87) | 12827812 |
Common
| Value | Count | Frequency (%) |
| 4849378 | ||
| 1 | 1879993 | 12.6% |
| , | 1587604 | 10.6% |
| 8 | 1317965 | 8.8% |
| ) | 828480 | 5.5% |
| ( | 828480 | 5.5% |
| 9 | 685308 | 4.6% |
| 7 | 543716 | 3.6% |
| . | 385539 | 2.6% |
| 5 | 369738 | 2.5% |
| Other values (8) | 1658416 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56587562 | |
| None | 129994 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4975492 | 8.8% |
| 4849378 | 8.6% | |
| i | 3756798 | 6.6% |
| e | 3416706 | 6.0% |
| r | 2838443 | 5.0% |
| s | 2680069 | 4.7% |
| o | 2507220 | 4.4% |
| l | 2493288 | 4.4% |
| n | 2462018 | 4.4% |
| t | 1946947 | 3.4% |
| Other values (60) | 24661203 |
None
| Value | Count | Frequency (%) |
| ü | 34699 | |
| ö | 25701 | |
| è | 22198 | |
| é | 21881 | |
| ø | 9165 | 7.1% |
| å | 5072 | 3.9% |
| Ö | 4790 | 3.7% |
| á | 1982 | 1.5% |
| ä | 1161 | 0.9% |
| í | 1065 | 0.8% |
| Other values (35) | 2280 | 1.8% |
Missing 
| Distinct | 133993 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 353775 |
| Missing (%) | 18.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 59 |
| Mean length | 19.44688666 |
| Min length | 4 |
Unique
| Unique | 51619 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Scypha sp. |
|---|---|
| 2nd row | Bulla striata |
| 3rd row | Stylopathes columnaris |
| 4th row | Ophiothrix suensonii |
| 5th row | Cypraea labrolineata |
| Value | Count | Frequency (%) |
| sp | 198063 | 6.0% |
| conus | 24328 | 0.7% |
| cypraea | 15395 | 0.5% |
| cambarus | 12003 | 0.4% |
| cerithium | 9397 | 0.3% |
| orconectes | 8683 | 0.3% |
| procambarus | 8141 | 0.2% |
| nassarius | 6728 | 0.2% |
| gracilis | 6632 | 0.2% |
| terebra | 5168 | 0.2% |
| Other values (70829) | 3025211 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3610431 | 11.8% |
| i | 2750408 | 9.0% |
| s | 2277504 | 7.4% |
| e | 1954344 | 6.4% |
| r | 1901340 | 6.2% |
| o | 1840596 | 6.0% |
| 1747131 | 5.7% | |
| l | 1714269 | 5.6% |
| n | 1541700 | 5.0% |
| t | 1537214 | 5.0% |
| Other values (68) | 9707587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26724880 | |
| Space Separator | 1747131 | 5.7% |
| Uppercase Letter | 1685395 | 5.5% |
| Other Punctuation | 198792 | 0.7% |
| Open Punctuation | 112865 | 0.4% |
| Close Punctuation | 112865 | 0.4% |
| Decimal Number | 468 | < 0.1% |
| Dash Punctuation | 110 | < 0.1% |
| Math Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3610431 | |
| i | 2750408 | |
| s | 2277504 | 8.5% |
| e | 1954344 | 7.3% |
| r | 1901340 | 7.1% |
| o | 1840596 | 6.9% |
| l | 1714269 | 6.4% |
| n | 1541700 | 5.8% |
| t | 1537214 | 5.8% |
| u | 1522243 | 5.7% |
| Other values (18) | 6074831 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 246724 | |
| P | 236762 | |
| A | 165937 | |
| S | 135252 | 8.0% |
| M | 109812 | 6.5% |
| T | 106747 | 6.3% |
| L | 97706 | 5.8% |
| E | 85779 | 5.1% |
| O | 78988 | 4.7% |
| N | 66257 | 3.9% |
| Other values (16) | 355431 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 156 | |
| 8 | 110 | |
| 4 | 58 | 12.4% |
| 9 | 38 | 8.1% |
| 6 | 27 | 5.8% |
| 2 | 25 | 5.3% |
| 5 | 19 | 4.1% |
| 7 | 16 | 3.4% |
| 0 | 12 | 2.6% |
| 3 | 7 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 198575 | |
| , | 107 | 0.1% |
| " | 60 | < 0.1% |
| / | 29 | < 0.1% |
| ' | 15 | < 0.1% |
| & | 3 | < 0.1% |
| ? | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 112864 | |
| [ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 112864 | |
| ] | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1747131 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28410275 | |
| Common | 2172249 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3610431 | |
| i | 2750408 | 9.7% |
| s | 2277504 | 8.0% |
| e | 1954344 | 6.9% |
| r | 1901340 | 6.7% |
| o | 1840596 | 6.5% |
| l | 1714269 | 6.0% |
| n | 1541700 | 5.4% |
| t | 1537214 | 5.4% |
| u | 1522243 | 5.4% |
| Other values (44) | 7760226 |
Common
| Value | Count | Frequency (%) |
| 1747131 | ||
| . | 198575 | 9.1% |
| ( | 112864 | 5.2% |
| ) | 112864 | 5.2% |
| 1 | 156 | < 0.1% |
| 8 | 110 | < 0.1% |
| - | 110 | < 0.1% |
| , | 107 | < 0.1% |
| " | 60 | < 0.1% |
| 4 | 58 | < 0.1% |
| Other values (14) | 214 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30582508 | |
| None | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3610431 | 11.8% |
| i | 2750408 | 9.0% |
| s | 2277504 | 7.4% |
| e | 1954344 | 6.4% |
| r | 1901340 | 6.2% |
| o | 1840596 | 6.0% |
| 1747131 | 5.7% | |
| l | 1714269 | 5.6% |
| n | 1541700 | 5.0% |
| t | 1537214 | 5.0% |
| Other values (66) | 9707571 |
None
| Value | Count | Frequency (%) |
| ü | 15 | |
| æ | 1 | 6.2% |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 1926387 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1926387 | |
| M | 1926387 | |
| L | 1926387 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5779161 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1926387 | |
| M | 1926387 | |
| L | 1926387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5779161 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1926387 | |
| M | 1926387 | |
| L | 1926387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5779161 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1926387 | |
| M | 1926387 | |
| L | 1926387 |
lastParsed
Text
| Distinct | 209952 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99588194 |
| Min length | 7 |
Unique
| Unique | 9127 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 2024-12-02T13:57:44.311Z |
|---|---|
| 2nd row | 2024-12-02T13:57:20.485Z |
| 3rd row | 2024-12-02T13:57:18.447Z |
| 4th row | 2024-12-02T13:57:45.124Z |
| 5th row | 2024-12-02T13:57:20.489Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:28.783z | 37 | < 0.1% |
| 2024-12-02t13:57:52.889z | 37 | < 0.1% |
| 2024-12-02t13:57:43.700z | 36 | < 0.1% |
| 2024-12-02t13:58:01.714z | 36 | < 0.1% |
| 2024-12-02t13:57:40.815z | 36 | < 0.1% |
| 2024-12-02t13:57:30.406z | 35 | < 0.1% |
| 2024-12-02t13:57:53.093z | 35 | < 0.1% |
| 2024-12-02t13:57:41.994z | 35 | < 0.1% |
| 2024-12-02t13:57:40.927z | 35 | < 0.1% |
| 2024-12-02t13:57:35.574z | 35 | < 0.1% |
| Other values (209942) | 1926034 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| : | 3852774 | |
| - | 3852774 | |
| 4 | 3098095 | 6.7% |
| 5 | 3058765 | 6.6% |
| 3 | 3051121 | 6.6% |
| T | 1926388 | 4.2% |
| Z | 1926387 | 4.2% |
| Other values (24) | 6919021 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32742672 | |
| Other Punctuation | 5777192 | 12.5% |
| Uppercase Letter | 3852785 | 8.3% |
| Dash Punctuation | 3852774 | 8.3% |
| Lowercase Letter | 28 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 4 | |
| h | 4 | |
| n | 3 | |
| i | 2 | |
| c | 2 | |
| y | 2 | |
| u | 2 | |
| o | 1 | 3.6% |
| s | 1 | 3.6% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| 4 | 3098095 | 9.5% |
| 5 | 3058765 | 9.3% |
| 3 | 3051121 | 9.3% |
| 7 | 1479601 | 4.5% |
| 9 | 1231841 | 3.8% |
| 6 | 1162885 | 3.6% |
| 8 | 1120238 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1926388 | |
| Z | 1926387 | |
| E | 3 | < 0.1% |
| S | 2 | < 0.1% |
| C | 2 | < 0.1% |
| A | 1 | < 0.1% |
| P | 1 | < 0.1% |
| D | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852774 | |
| . | 1924418 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3852774 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42372638 | |
| Latin | 3852813 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 1926388 | |
| Z | 1926387 | |
| a | 4 | < 0.1% |
| r | 4 | < 0.1% |
| h | 4 | < 0.1% |
| n | 3 | < 0.1% |
| E | 3 | < 0.1% |
| i | 2 | < 0.1% |
| c | 2 | < 0.1% |
| y | 2 | < 0.1% |
| Other values (11) | 14 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| : | 3852774 | |
| - | 3852774 | |
| 4 | 3098095 | 7.3% |
| 5 | 3058765 | 7.2% |
| 3 | 3051121 | 7.2% |
| . | 1924418 | 4.5% |
| 7 | 1479601 | 3.5% |
| Other values (3) | 3514964 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46225451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 8796773 | |
| 0 | 4884695 | |
| 1 | 4858658 | |
| : | 3852774 | |
| - | 3852774 | |
| 4 | 3098095 | 6.7% |
| 5 | 3058765 | 6.6% |
| 3 | 3051121 | 6.6% |
| T | 1926388 | 4.2% |
| Z | 1926387 | 4.2% |
| Other values (24) | 6919021 |
lastCrawled
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99997872 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 1926387 | |
| echinorhynchus | 1 | < 0.1% |
| setaria | 1 | < 0.1% |
| sphyranura | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 9631935 | |
| 1 | 7705548 | |
| 4 | 5779161 | |
| - | 3852774 | 8.3% |
| 0 | 3852774 | 8.3% |
| : | 3852774 | 8.3% |
| 6 | 1926387 | 4.2% |
| Z | 1926387 | 4.2% |
| . | 1926387 | 4.2% |
| 3 | 1926387 | 4.2% |
| Other values (17) | 3852805 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32748579 | |
| Other Punctuation | 5779161 | 12.5% |
| Uppercase Letter | 3852777 | 8.3% |
| Dash Punctuation | 3852774 | 8.3% |
| Lowercase Letter | 28 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 4 | |
| r | 4 | |
| n | 3 | |
| y | 2 | |
| u | 2 | |
| c | 2 | |
| i | 2 | |
| o | 1 | 3.6% |
| s | 1 | 3.6% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9631935 | |
| 1 | 7705548 | |
| 4 | 5779161 | |
| 0 | 3852774 | 11.8% |
| 6 | 1926387 | 5.9% |
| 3 | 1926387 | 5.9% |
| 8 | 1926387 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 1926387 | |
| T | 1926387 | |
| S | 2 | < 0.1% |
| E | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852774 | |
| . | 1926387 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3852774 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42380514 | |
| Latin | 3852805 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Z | 1926387 | |
| T | 1926387 | |
| a | 4 | < 0.1% |
| h | 4 | < 0.1% |
| r | 4 | < 0.1% |
| n | 3 | < 0.1% |
| y | 2 | < 0.1% |
| S | 2 | < 0.1% |
| u | 2 | < 0.1% |
| c | 2 | < 0.1% |
| Other values (7) | 8 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 9631935 | |
| 1 | 7705548 | |
| 4 | 5779161 | |
| - | 3852774 | 9.1% |
| 0 | 3852774 | 9.1% |
| : | 3852774 | 9.1% |
| 6 | 1926387 | 4.5% |
| . | 1926387 | 4.5% |
| 3 | 1926387 | 4.5% |
| 8 | 1926387 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46233319 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 9631935 | |
| 1 | 7705548 | |
| 4 | 5779161 | |
| - | 3852774 | 8.3% |
| 0 | 3852774 | 8.3% |
| : | 3852774 | 8.3% |
| 6 | 1926387 | 4.2% |
| Z | 1926387 | 4.2% |
| . | 1926387 | 4.2% |
| 3 | 1926387 | 4.2% |
| Other values (17) | 3852805 |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110144 |
| Missing (%) | 5.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.47822738 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 947669 | |
| false | 868580 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1816249 | |
| t | 947669 | |
| r | 947669 | |
| u | 947669 | |
| f | 868580 | |
| a | 868580 | |
| l | 868580 | |
| s | 868580 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8133576 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1816249 | |
| t | 947669 | |
| r | 947669 | |
| u | 947669 | |
| f | 868580 | |
| a | 868580 | |
| l | 868580 | |
| s | 868580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8133576 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1816249 | |
| t | 947669 | |
| r | 947669 | |
| u | 947669 | |
| f | 868580 | |
| a | 868580 | |
| l | 868580 | |
| s | 868580 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8133576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1816249 | |
| t | 947669 | |
| r | 947669 | |
| u | 947669 | |
| f | 868580 | |
| a | 868580 | |
| l | 868580 | |
| s | 868580 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926392 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| - | 4 | |
| 2 | 3 | |
| b | 3 | |
| 4 | 3 | |
| 8 | 2 | 5.6% |
| 3 | 2 | 5.6% |
| 5 | 2 | 5.6% |
| 9 | 2 | 5.6% |
| Other values (6) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18 | |
| Lowercase Letter | 14 | |
| Dash Punctuation | 4 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 4 | 3 | |
| 8 | 2 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 1 | 1 | 5.6% |
| 7 | 1 | 5.6% |
| 0 | 1 | 5.6% |
| 6 | 1 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 4 | |
| 2 | 3 | |
| 4 | 3 | |
| 8 | 2 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 1 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| 0 | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| - | 4 | |
| 2 | 3 | |
| b | 3 | |
| 4 | 3 | |
| 8 | 2 | 5.6% |
| 3 | 2 | 5.6% |
| 5 | 2 | 5.6% |
| 9 | 2 | 5.6% |
| Other values (6) | 7 |
projectId
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926390 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 10 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | lageniformis |
|---|---|
| 2nd row | US |
| 3rd row | labiatopapillosa |
| Value | Count | Frequency (%) |
| lageniformis | 1 | |
| us | 1 | |
| labiatopapillosa | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5 | |
| l | 4 | |
| i | 4 | |
| o | 3 | |
| s | 2 | 6.7% |
| p | 2 | 6.7% |
| g | 1 | 3.3% |
| e | 1 | 3.3% |
| n | 1 | 3.3% |
| f | 1 | 3.3% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28 | |
| Uppercase Letter | 2 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| l | 4 | |
| i | 4 | |
| o | 3 | |
| s | 2 | 7.1% |
| p | 2 | 7.1% |
| g | 1 | 3.6% |
| e | 1 | 3.6% |
| n | 1 | 3.6% |
| f | 1 | 3.6% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| l | 4 | |
| i | 4 | |
| o | 3 | |
| s | 2 | 6.7% |
| p | 2 | 6.7% |
| g | 1 | 3.3% |
| e | 1 | 3.3% |
| n | 1 | 3.3% |
| f | 1 | 3.3% |
| Other values (6) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5 | |
| l | 4 | |
| i | 4 | |
| o | 3 | |
| s | 2 | 6.7% |
| p | 2 | 6.7% |
| g | 1 | 3.3% |
| e | 1 | 3.3% |
| n | 1 | 3.3% |
| f | 1 | 3.3% |
| Other values (6) | 6 |
isSequenced
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 5 |
| Mean length | 4.997351001 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 1921265 | |
| true | 5122 | 0.3% |
| 2024-12-02t13:57:24.316z | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1921265 | |
| l | 1921265 | |
| s | 1921265 | |
| a | 1921265 | |
| t | 5122 | 0.1% |
| r | 5122 | 0.1% |
| u | 5122 | 0.1% |
| 2 | 5 | < 0.1% |
| 1 | 3 | < 0.1% |
| Other values (11) | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9626813 | |
| Decimal Number | 17 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1921265 | |
| l | 1921265 | |
| s | 1921265 | |
| a | 1921265 | |
| t | 5122 | 0.1% |
| r | 5122 | 0.1% |
| u | 5122 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5 | |
| 1 | 3 | |
| 3 | 2 | 11.8% |
| 4 | 2 | 11.8% |
| 0 | 2 | 11.8% |
| 5 | 1 | 5.9% |
| 7 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9626815 | |
| Common | 22 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 5 | |
| 1 | 3 | |
| : | 2 | 9.1% |
| 3 | 2 | 9.1% |
| 4 | 2 | 9.1% |
| - | 2 | 9.1% |
| 0 | 2 | 9.1% |
| 5 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| . | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1921265 | |
| l | 1921265 | |
| s | 1921265 | |
| a | 1921265 | |
| t | 5122 | 0.1% |
| r | 5122 | 0.1% |
| u | 5122 | 0.1% |
| T | 1 | < 0.1% |
| Z | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9626837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1926387 | |
| f | 1921265 | |
| l | 1921265 | |
| s | 1921265 | |
| a | 1921265 | |
| t | 5122 | 0.1% |
| r | 5122 | 0.1% |
| u | 5122 | 0.1% |
| 2 | 5 | < 0.1% |
| 1 | 3 | < 0.1% |
| Other values (11) | 16 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 115678 |
| Missing (%) | 6.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.88896817 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | LATIN_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| north_america | 900416 | |
| latin_america | 368762 | |
| asia | 206888 | 11.4% |
| oceania | 167374 | 9.2% |
| africa | 56930 | 3.1% |
| europe | 56674 | 3.1% |
| antarctica | 53671 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3930515 | |
| R | 2336869 | |
| I | 2122803 | |
| C | 1600824 | |
| E | 1549900 | 7.9% |
| N | 1490223 | 7.6% |
| T | 1376520 | 7.0% |
| _ | 1269178 | 6.4% |
| M | 1269178 | 6.4% |
| O | 1124464 | 5.7% |
| Other values (6) | 1646344 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 18447640 | |
| Connector Punctuation | 1269178 | 6.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3930515 | |
| R | 2336869 | |
| I | 2122803 | |
| C | 1600824 | |
| E | 1549900 | 8.4% |
| N | 1490223 | 8.1% |
| T | 1376520 | 7.5% |
| M | 1269178 | 6.9% |
| O | 1124464 | 6.1% |
| H | 900416 | 4.9% |
| Other values (5) | 745928 | 4.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1269178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18447640 | |
| Common | 1269178 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3930515 | |
| R | 2336869 | |
| I | 2122803 | |
| C | 1600824 | |
| E | 1549900 | 8.4% |
| N | 1490223 | 8.1% |
| T | 1376520 | 7.5% |
| M | 1269178 | 6.9% |
| O | 1124464 | 6.1% |
| H | 900416 | 4.9% |
| Other values (5) | 745928 | 4.0% |
Common
| Value | Count | Frequency (%) |
| _ | 1269178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19716818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 3930515 | |
| R | 2336869 | |
| I | 2122803 | |
| C | 1600824 | |
| E | 1549900 | 7.9% |
| N | 1490223 | 7.6% |
| T | 1376520 | 7.0% |
| _ | 1269178 | 6.4% |
| M | 1269178 | 6.4% |
| O | 1124464 | 5.7% |
| Other values (6) | 1646344 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.99998962 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 1926387 | |
| species | 2 | < 0.1% |
| genus | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 3852774 | |
| A | 3852774 | |
| E | 1926392 | |
| I | 1926389 | |
| C | 1926389 | |
| N | 1926388 | |
| O | 1926387 | |
| T | 1926387 | |
| H | 1926387 | |
| _ | 1926387 | |
| Other values (5) | 1926396 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 23116663 | |
| Connector Punctuation | 1926387 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3852774 | |
| A | 3852774 | |
| E | 1926392 | |
| I | 1926389 | |
| C | 1926389 | |
| N | 1926388 | |
| O | 1926387 | |
| T | 1926387 | |
| H | 1926387 | |
| M | 1926387 | |
| Other values (4) | 9 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1926387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23116663 | |
| Common | 1926387 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 3852774 | |
| A | 3852774 | |
| E | 1926392 | |
| I | 1926389 | |
| C | 1926389 | |
| N | 1926388 | |
| O | 1926387 | |
| T | 1926387 | |
| H | 1926387 | |
| M | 1926387 | |
| Other values (4) | 9 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 1926387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25043050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 3852774 | |
| A | 3852774 | |
| E | 1926392 | |
| I | 1926389 | |
| C | 1926389 | |
| N | 1926388 | |
| O | 1926387 | |
| T | 1926387 | |
| H | 1926387 | |
| _ | 1926387 | |
| Other values (5) | 1926396 |
level0Gid
Text
Missing 
| Distinct | 226 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1691070 |
| Missing (%) | 87.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | PAN |
| 3rd row | USA |
| 4th row | USA |
| 5th row | PAN |
| Value | Count | Frequency (%) |
| usa | 138756 | |
| pan | 11701 | 5.0% |
| jpn | 8794 | 3.7% |
| mex | 4690 | 2.0% |
| phl | 4467 | 1.9% |
| can | 4382 | 1.9% |
| dom | 3446 | 1.5% |
| cri | 3146 | 1.3% |
| mdg | 2984 | 1.3% |
| pri | 2846 | 1.2% |
| Other values (216) | 50111 | 21.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 169536 | |
| U | 149988 | |
| S | 147144 | |
| N | 36249 | 5.1% |
| P | 32498 | 4.6% |
| M | 17314 | 2.5% |
| C | 16563 | 2.3% |
| R | 16511 | 2.3% |
| I | 11617 | 1.6% |
| J | 11408 | 1.6% |
| Other values (18) | 97141 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 705967 | |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 169536 | |
| U | 149988 | |
| S | 147144 | |
| N | 36249 | 5.1% |
| P | 32498 | 4.6% |
| M | 17314 | 2.5% |
| C | 16563 | 2.3% |
| R | 16511 | 2.3% |
| I | 11617 | 1.6% |
| J | 11408 | 1.6% |
| Other values (16) | 97139 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 6 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 705967 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 169536 | |
| U | 149988 | |
| S | 147144 | |
| N | 36249 | 5.1% |
| P | 32498 | 4.6% |
| M | 17314 | 2.5% |
| C | 16563 | 2.3% |
| R | 16511 | 2.3% |
| I | 11617 | 1.6% |
| J | 11408 | 1.6% |
| Other values (16) | 97139 |
Common
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 705969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 169536 | |
| U | 149988 | |
| S | 147144 | |
| N | 36249 | 5.1% |
| P | 32498 | 4.6% |
| M | 17314 | 2.5% |
| C | 16563 | 2.3% |
| R | 16511 | 2.3% |
| I | 11617 | 1.6% |
| J | 11408 | 1.6% |
| Other values (18) | 97141 |
level0Name
Text
Missing 
| Distinct | 226 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1691070 |
| Missing (%) | 87.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 11.1625043 |
| Min length | 4 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Panama |
| 3rd row | United States |
| 4th row | United States |
| 5th row | Panama |
| Value | Count | Frequency (%) |
| united | 139310 | |
| states | 138840 | |
| panama | 11701 | 2.9% |
| japan | 8794 | 2.2% |
| méxico | 4690 | 1.2% |
| philippines | 4467 | 1.1% |
| canada | 4382 | 1.1% |
| republic | 3662 | 0.9% |
| dominican | 3446 | 0.9% |
| costa | 3146 | 0.8% |
| Other values (265) | 77445 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 437054 | |
| e | 318254 | |
| a | 296155 | |
| i | 217274 | |
| n | 208865 | |
| s | 170178 | 6.5% |
| 164560 | 6.3% | |
| d | 159630 | 6.1% |
| S | 144075 | 5.5% |
| U | 140430 | 5.3% |
| Other values (52) | 370319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2060448 | |
| Uppercase Letter | 398595 | 15.2% |
| Space Separator | 164560 | 6.3% |
| Other Punctuation | 3038 | 0.1% |
| Close Punctuation | 67 | < 0.1% |
| Open Punctuation | 67 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 437054 | |
| e | 318254 | |
| a | 296155 | |
| i | 217274 | |
| n | 208865 | |
| s | 170178 | 8.3% |
| d | 159630 | 7.7% |
| o | 34362 | 1.7% |
| c | 31190 | 1.5% |
| r | 29949 | 1.5% |
| Other values (21) | 157537 | 7.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 144075 | |
| U | 140430 | |
| P | 22372 | 5.6% |
| C | 16711 | 4.2% |
| J | 10574 | 2.7% |
| R | 10283 | 2.6% |
| M | 9896 | 2.5% |
| A | 6942 | 1.7% |
| B | 6541 | 1.6% |
| T | 6469 | 1.6% |
| Other values (14) | 24302 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1616 | |
| , | 1418 | |
| ' | 4 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 164560 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 67 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 67 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2459043 | |
| Common | 167751 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 437054 | |
| e | 318254 | |
| a | 296155 | |
| i | 217274 | |
| n | 208865 | |
| s | 170178 | 6.9% |
| d | 159630 | 6.5% |
| S | 144075 | 5.9% |
| U | 140430 | 5.7% |
| o | 34362 | 1.4% |
| Other values (45) | 332766 |
Common
| Value | Count | Frequency (%) |
| 164560 | ||
| . | 1616 | 1.0% |
| , | 1418 | 0.8% |
| ) | 67 | < 0.1% |
| ( | 67 | < 0.1% |
| - | 19 | < 0.1% |
| ' | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2620383 | |
| None | 6411 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 437054 | |
| e | 318254 | |
| a | 296155 | |
| i | 217274 | |
| n | 208865 | |
| s | 170178 | 6.5% |
| 164560 | 6.3% | |
| d | 159630 | 6.1% |
| S | 144075 | 5.5% |
| U | 140430 | 5.4% |
| Other values (47) | 363908 |
None
| Value | Count | Frequency (%) |
| é | 4700 | |
| ç | 1697 | 26.5% |
| ã | 5 | 0.1% |
| í | 5 | 0.1% |
| ô | 4 | 0.1% |
level1Gid
Text
Missing 
| Distinct | 1804 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1694638 |
| Missing (%) | 88.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.672701776 |
| Min length | 6 |
Unique
| Unique | 305 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | USA.10_1 |
|---|---|
| 2nd row | PAN.4_1 |
| 3rd row | USA.14_1 |
| 4th row | USA.16_1 |
| 5th row | PAN.12_1 |
| Value | Count | Frequency (%) |
| usa.10_1 | 18116 | 7.8% |
| usa.5_1 | 8182 | 3.5% |
| usa.43_1 | 8000 | 3.5% |
| pan.4_1 | 7933 | 3.4% |
| jpn.32_1 | 6827 | 2.9% |
| usa.47_1 | 6423 | 2.8% |
| usa.21_1 | 5755 | 2.5% |
| usa.44_1 | 5753 | 2.5% |
| usa.11_1 | 5094 | 2.2% |
| usa.9_1 | 4888 | 2.1% |
| Other values (1794) | 154784 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 319208 | |
| _ | 231589 | |
| . | 231553 | |
| A | 166849 | |
| U | 148282 | |
| S | 146784 | |
| 2 | 66976 | 3.8% |
| 4 | 61992 | 3.5% |
| 3 | 50877 | 2.9% |
| N | 36177 | 2.0% |
| Other values (28) | 317900 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 695761 | |
| Decimal Number | 619284 | |
| Connector Punctuation | 231589 | 13.0% |
| Other Punctuation | 231553 | 13.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 166849 | |
| U | 148282 | |
| S | 146784 | |
| N | 36177 | 5.2% |
| P | 32438 | 4.7% |
| M | 17257 | 2.5% |
| R | 16460 | 2.4% |
| C | 14782 | 2.1% |
| I | 11490 | 1.7% |
| J | 11408 | 1.6% |
| Other values (16) | 93834 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 319208 | |
| 2 | 66976 | 10.8% |
| 4 | 61992 | 10.0% |
| 3 | 50877 | 8.2% |
| 5 | 30137 | 4.9% |
| 0 | 26692 | 4.3% |
| 9 | 18370 | 3.0% |
| 7 | 17007 | 2.7% |
| 6 | 15063 | 2.4% |
| 8 | 12962 | 2.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 231589 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 231553 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1082426 | |
| Latin | 695761 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 166849 | |
| U | 148282 | |
| S | 146784 | |
| N | 36177 | 5.2% |
| P | 32438 | 4.7% |
| M | 17257 | 2.5% |
| R | 16460 | 2.4% |
| C | 14782 | 2.1% |
| I | 11490 | 1.7% |
| J | 11408 | 1.6% |
| Other values (16) | 93834 |
Common
| Value | Count | Frequency (%) |
| 1 | 319208 | |
| _ | 231589 | |
| . | 231553 | |
| 2 | 66976 | 6.2% |
| 4 | 61992 | 5.7% |
| 3 | 50877 | 4.7% |
| 5 | 30137 | 2.8% |
| 0 | 26692 | 2.5% |
| 9 | 18370 | 1.7% |
| 7 | 17007 | 1.6% |
| Other values (2) | 28025 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1778187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 319208 | |
| _ | 231589 | |
| . | 231553 | |
| A | 166849 | |
| U | 148282 | |
| S | 146784 | |
| 2 | 66976 | 3.8% |
| 4 | 61992 | 3.5% |
| 3 | 50877 | 2.9% |
| N | 36177 | 2.0% |
| Other values (28) | 317900 |
level1Name
Text
Missing 
| Distinct | 1737 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 1694634 |
| Missing (%) | 88.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 30 |
| Mean length | 8.96956321 |
| Min length | 3 |
Unique
| Unique | 296 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Colón |
| 3rd row | Illinois |
| 4th row | Iowa |
| 5th row | Panamá |
| Value | Count | Frequency (%) |
| florida | 18120 | 6.1% |
| california | 9283 | 3.1% |
| carolina | 8221 | 2.8% |
| tennessee | 8000 | 2.7% |
| colón | 7933 | 2.7% |
| virginia | 7606 | 2.6% |
| okinawa | 6827 | 2.3% |
| new | 5902 | 2.0% |
| maryland | 5759 | 1.9% |
| texas | 5753 | 1.9% |
| Other values (1876) | 212777 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 294071 | |
| i | 195344 | 9.4% |
| n | 167961 | 8.1% |
| o | 145084 | 7.0% |
| r | 120163 | 5.8% |
| s | 119088 | 5.7% |
| e | 117144 | 5.6% |
| l | 98274 | 4.7% |
| t | 78261 | 3.8% |
| 64422 | 3.1% | |
| Other values (105) | 678965 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1721765 | |
| Uppercase Letter | 288487 | 13.9% |
| Space Separator | 64422 | 3.1% |
| Dash Punctuation | 2995 | 0.1% |
| Other Punctuation | 1067 | 0.1% |
| Modifier Symbol | 28 | < 0.1% |
| Connector Punctuation | 5 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 294071 | |
| i | 195344 | |
| n | 167961 | |
| o | 145084 | |
| r | 120163 | 7.0% |
| s | 119088 | 6.9% |
| e | 117144 | 6.8% |
| l | 98274 | 5.7% |
| t | 78261 | 4.5% |
| u | 50903 | 3.0% |
| Other values (60) | 335472 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 41271 | |
| M | 28854 | 10.0% |
| T | 22649 | 7.9% |
| A | 21731 | 7.5% |
| S | 20838 | 7.2% |
| F | 20114 | 7.0% |
| N | 18874 | 6.5% |
| O | 15937 | 5.5% |
| V | 10384 | 3.6% |
| I | 9849 | 3.4% |
| Other values (24) | 77986 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 910 | |
| / | 61 | 5.7% |
| . | 55 | 5.2% |
| ! | 25 | 2.3% |
| , | 16 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 64422 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2995 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 28 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2010252 | |
| Common | 68525 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 294071 | |
| i | 195344 | 9.7% |
| n | 167961 | 8.4% |
| o | 145084 | 7.2% |
| r | 120163 | 6.0% |
| s | 119088 | 5.9% |
| e | 117144 | 5.8% |
| l | 98274 | 4.9% |
| t | 78261 | 3.9% |
| u | 50903 | 2.5% |
| Other values (94) | 623959 |
Common
| Value | Count | Frequency (%) |
| 64422 | ||
| - | 2995 | 4.4% |
| ' | 910 | 1.3% |
| / | 61 | 0.1% |
| . | 55 | 0.1% |
| ` | 28 | < 0.1% |
| ! | 25 | < 0.1% |
| , | 16 | < 0.1% |
| _ | 5 | < 0.1% |
| ] | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2053783 | |
| None | 24840 | 1.2% |
| Latin Ext Additional | 154 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 294071 | |
| i | 195344 | 9.5% |
| n | 167961 | 8.2% |
| o | 145084 | 7.1% |
| r | 120163 | 5.9% |
| s | 119088 | 5.8% |
| e | 117144 | 5.7% |
| l | 98274 | 4.8% |
| t | 78261 | 3.8% |
| 64422 | 3.1% | |
| Other values (53) | 653971 |
None
| Value | Count | Frequency (%) |
| ó | 10786 | |
| í | 4492 | |
| á | 4303 | 17.3% |
| é | 1522 | 6.1% |
| Î | 1159 | 4.7% |
| ü | 851 | 3.4% |
| ã | 420 | 1.7% |
| ö | 314 | 1.3% |
| à | 159 | 0.6% |
| ñ | 150 | 0.6% |
| Other values (31) | 684 | 2.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ộ | 64 | |
| ồ | 45 | |
| ị | 13 | 8.4% |
| ầ | 11 | 7.1% |
| ệ | 8 | 5.2% |
| ả | 5 | 3.2% |
| ạ | 4 | 2.6% |
| ẵ | 1 | 0.6% |
| ắ | 1 | 0.6% |
| ằ | 1 | 0.6% |
level2Gid
Text
Missing 
| Distinct | 7611 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 1708984 |
| Missing (%) | 88.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.36195374 |
| Min length | 7 |
Unique
| Unique | 1730 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | USA.10.59_1 |
|---|---|
| 2nd row | PAN.4.2_1 |
| 3rd row | USA.14.18_1 |
| 4th row | USA.16.3_1 |
| 5th row | PAN.12.2_1 |
| Value | Count | Frequency (%) |
| jpn.32.28_1 | 6059 | 2.8% |
| usa.10.43_1 | 6013 | 2.8% |
| pan.4.2_1 | 5746 | 2.6% |
| usa.9.1_1 | 4888 | 2.2% |
| usa.10.44_1 | 4299 | 2.0% |
| usa.22.1_1 | 3251 | 1.5% |
| mdg.2.1_1 | 2723 | 1.3% |
| dom.29.3_1 | 2676 | 1.2% |
| cri.5.2_1 | 2210 | 1.0% |
| pan.4.5_1 | 2107 | 1.0% |
| Other values (7601) | 177437 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 434450 | |
| 1 | 372514 | |
| _ | 217409 | |
| A | 164597 | 7.3% |
| U | 146942 | 6.5% |
| S | 144683 | 6.4% |
| 2 | 131222 | 5.8% |
| 4 | 103045 | 4.6% |
| 3 | 93136 | 4.1% |
| 5 | 60231 | 2.7% |
| Other values (28) | 384553 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 948698 | |
| Uppercase Letter | 652225 | |
| Other Punctuation | 434450 | |
| Connector Punctuation | 217409 | 9.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 164597 | |
| U | 146942 | |
| S | 144683 | |
| N | 35543 | 5.4% |
| P | 27322 | 4.2% |
| C | 13617 | 2.1% |
| M | 13373 | 2.1% |
| R | 11680 | 1.8% |
| J | 9629 | 1.5% |
| E | 9601 | 1.5% |
| Other values (16) | 75238 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 372514 | |
| 2 | 131222 | 13.8% |
| 4 | 103045 | 10.9% |
| 3 | 93136 | 9.8% |
| 5 | 60231 | 6.3% |
| 0 | 42025 | 4.4% |
| 6 | 40348 | 4.3% |
| 7 | 37009 | 3.9% |
| 8 | 36094 | 3.8% |
| 9 | 33074 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 434450 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 217409 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1600557 | |
| Latin | 652225 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 164597 | |
| U | 146942 | |
| S | 144683 | |
| N | 35543 | 5.4% |
| P | 27322 | 4.2% |
| C | 13617 | 2.1% |
| M | 13373 | 2.1% |
| R | 11680 | 1.8% |
| J | 9629 | 1.5% |
| E | 9601 | 1.5% |
| Other values (16) | 75238 |
Common
| Value | Count | Frequency (%) |
| . | 434450 | |
| 1 | 372514 | |
| _ | 217409 | |
| 2 | 131222 | 8.2% |
| 4 | 103045 | 6.4% |
| 3 | 93136 | 5.8% |
| 5 | 60231 | 3.8% |
| 0 | 42025 | 2.6% |
| 6 | 40348 | 2.5% |
| 7 | 37009 | 2.3% |
| Other values (2) | 69168 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2252782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 434450 | |
| 1 | 372514 | |
| _ | 217409 | |
| A | 164597 | 7.3% |
| U | 146942 | 6.5% |
| S | 144683 | 6.4% |
| 2 | 131222 | 5.8% |
| 4 | 103045 | 4.6% |
| 3 | 93136 | 4.1% |
| 5 | 60231 | 2.7% |
| Other values (28) | 384553 |
level2Name
Text
Missing 
| Distinct | 6184 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 1709049 |
| Missing (%) | 88.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.376522931 |
| Min length | 1 |
Unique
| Unique | 1557 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Seminole |
|---|---|
| 2nd row | Colón |
| 3rd row | Cumberland |
| 4th row | Allamakee |
| 5th row | Chepo |
| Value | Count | Frequency (%) |
| san | 6246 | 2.3% |
| onna | 6059 | 2.2% |
| miami-dade | 6013 | 2.2% |
| colón | 5755 | 2.1% |
| of | 5128 | 1.9% |
| columbia | 5068 | 1.8% |
| monroe | 4935 | 1.8% |
| district | 4903 | 1.8% |
| de | 3904 | 1.4% |
| barnstable | 3251 | 1.2% |
| Other values (6463) | 224555 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 214955 | 11.8% |
| n | 155743 | 8.6% |
| e | 142976 | 7.9% |
| o | 137584 | 7.6% |
| i | 116282 | 6.4% |
| r | 99257 | 5.5% |
| l | 83851 | 4.6% |
| t | 80842 | 4.4% |
| s | 71222 | 3.9% |
| 58473 | 3.2% | |
| Other values (137) | 659402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1471166 | |
| Uppercase Letter | 273185 | 15.0% |
| Space Separator | 58473 | 3.2% |
| Dash Punctuation | 10521 | 0.6% |
| Other Punctuation | 5065 | 0.3% |
| Decimal Number | 1906 | 0.1% |
| Open Punctuation | 151 | < 0.1% |
| Close Punctuation | 116 | < 0.1% |
| Modifier Symbol | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 214955 | |
| n | 155743 | |
| e | 142976 | |
| o | 137584 | |
| i | 116282 | 7.9% |
| r | 99257 | 6.7% |
| l | 83851 | 5.7% |
| t | 80842 | 5.5% |
| s | 71222 | 4.8% |
| u | 53336 | 3.6% |
| Other values (75) | 315118 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 36023 | |
| M | 27375 | 10.0% |
| S | 23695 | 8.7% |
| D | 22216 | 8.1% |
| B | 17416 | 6.4% |
| P | 17006 | 6.2% |
| L | 16649 | 6.1% |
| A | 12629 | 4.6% |
| O | 11173 | 4.1% |
| G | 9385 | 3.4% |
| Other values (30) | 79618 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 963 | |
| 2 | 208 | 10.9% |
| 5 | 181 | 9.5% |
| 0 | 165 | 8.7% |
| 9 | 109 | 5.7% |
| 8 | 96 | 5.0% |
| 7 | 61 | 3.2% |
| 4 | 45 | 2.4% |
| 6 | 44 | 2.3% |
| 3 | 34 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2527 | |
| . | 2332 | |
| / | 177 | 3.5% |
| ? | 18 | 0.4% |
| , | 7 | 0.1% |
| & | 3 | 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 58473 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10521 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 151 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 116 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1744351 | |
| Common | 76236 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 214955 | 12.3% |
| n | 155743 | 8.9% |
| e | 142976 | 8.2% |
| o | 137584 | 7.9% |
| i | 116282 | 6.7% |
| r | 99257 | 5.7% |
| l | 83851 | 4.8% |
| t | 80842 | 4.6% |
| s | 71222 | 4.1% |
| u | 53336 | 3.1% |
| Other values (115) | 588303 |
Common
| Value | Count | Frequency (%) |
| 58473 | ||
| - | 10521 | 13.8% |
| ' | 2527 | 3.3% |
| . | 2332 | 3.1% |
| 1 | 963 | 1.3% |
| 2 | 208 | 0.3% |
| 5 | 181 | 0.2% |
| / | 177 | 0.2% |
| 0 | 165 | 0.2% |
| ( | 151 | 0.2% |
| Other values (12) | 538 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1803112 | |
| None | 17297 | 1.0% |
| Latin Ext Additional | 178 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 214955 | 11.9% |
| n | 155743 | 8.6% |
| e | 142976 | 7.9% |
| o | 137584 | 7.6% |
| i | 116282 | 6.4% |
| r | 99257 | 5.5% |
| l | 83851 | 4.7% |
| t | 80842 | 4.5% |
| s | 71222 | 3.9% |
| 58473 | 3.2% | |
| Other values (64) | 641927 |
None
| Value | Count | Frequency (%) |
| ó | 8847 | |
| á | 2728 | 15.8% |
| í | 1979 | 11.4% |
| é | 1235 | 7.1% |
| ñ | 556 | 3.2% |
| ã | 434 | 2.5% |
| ō | 364 | 2.1% |
| ú | 211 | 1.2% |
| à | 157 | 0.9% |
| Ō | 94 | 0.5% |
| Other values (46) | 692 | 4.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 56 | |
| ố | 23 | |
| ầ | 19 | 10.7% |
| ờ | 18 | 10.1% |
| ả | 12 | 6.7% |
| ự | 11 | 6.2% |
| ề | 11 | 6.2% |
| ớ | 7 | 3.9% |
| ậ | 5 | 2.8% |
| ợ | 4 | 2.2% |
| Other values (7) | 12 | 6.7% |
level3Gid
Text
Missing 
| Distinct | 3021 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 1886622 |
| Missing (%) | 97.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 11 |
| Mean length | 11.67596993 |
| Min length | 5 |
Unique
| Unique | 1211 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | PAN.4.2.6_1 |
|---|---|
| 2nd row | PAN.12.2.2_1 |
| 3rd row | MMR.4.2.6_1 |
| 4th row | PAN.12.1.4_1 |
| 5th row | CAN.9.20.18_1 |
| Value | Count | Frequency (%) |
| pan.4.2.4_1 | 3201 | 8.0% |
| mdg.2.1.5_1 | 2581 | 6.5% |
| pan.4.2.6_1 | 2281 | 5.7% |
| cri.5.2.1_1 | 2199 | 5.5% |
| pan.4.5.5_1 | 1729 | 4.3% |
| can.6.2.11_1 | 743 | 1.9% |
| pan.11.1.5_1 | 729 | 1.8% |
| phl.20.2.8_1 | 443 | 1.1% |
| phl.25.27.3_1 | 382 | 1.0% |
| pan.12.1.4_1 | 370 | 0.9% |
| Other values (3011) | 25113 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 119301 | |
| 1 | 77737 | |
| _ | 39767 | 8.6% |
| 2 | 30165 | 6.5% |
| 4 | 20568 | 4.4% |
| N | 19748 | 4.3% |
| A | 19069 | 4.1% |
| P | 17786 | 3.8% |
| 5 | 17385 | 3.7% |
| C | 11012 | 2.4% |
| Other values (34) | 91827 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 185939 | |
| Other Punctuation | 119301 | |
| Uppercase Letter | 119299 | |
| Connector Punctuation | 39767 | 8.6% |
| Lowercase Letter | 47 | < 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19748 | |
| A | 19069 | |
| P | 17786 | |
| C | 11012 | |
| H | 7837 | 6.6% |
| I | 6226 | 5.2% |
| R | 6127 | 5.1% |
| L | 5932 | 5.0% |
| D | 5171 | 4.3% |
| M | 3707 | 3.1% |
| Other values (13) | 16684 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 77737 | |
| 2 | 30165 | 16.2% |
| 4 | 20568 | 11.1% |
| 5 | 17385 | 9.3% |
| 3 | 10783 | 5.8% |
| 6 | 8776 | 4.7% |
| 8 | 5685 | 3.1% |
| 9 | 5534 | 3.0% |
| 7 | 5339 | 2.9% |
| 0 | 3967 | 2.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13 | |
| c | 12 | |
| b | 9 | |
| d | 6 | |
| e | 4 | 8.5% |
| f | 1 | 2.1% |
| l | 1 | 2.1% |
| s | 1 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 119301 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 39767 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 345019 | |
| Latin | 119346 | 25.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 19748 | |
| A | 19069 | |
| P | 17786 | |
| C | 11012 | |
| H | 7837 | 6.6% |
| I | 6226 | 5.2% |
| R | 6127 | 5.1% |
| L | 5932 | 5.0% |
| D | 5171 | 4.3% |
| M | 3707 | 3.1% |
| Other values (21) | 16731 |
Common
| Value | Count | Frequency (%) |
| . | 119301 | |
| 1 | 77737 | |
| _ | 39767 | 11.5% |
| 2 | 30165 | 8.7% |
| 4 | 20568 | 6.0% |
| 5 | 17385 | 5.0% |
| 3 | 10783 | 3.1% |
| 6 | 8776 | 2.5% |
| 8 | 5685 | 1.6% |
| 9 | 5534 | 1.6% |
| Other values (3) | 9318 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 464365 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 119301 | |
| 1 | 77737 | |
| _ | 39767 | 8.6% |
| 2 | 30165 | 6.5% |
| 4 | 20568 | 4.4% |
| N | 19748 | 4.3% |
| A | 19069 | 4.1% |
| P | 17786 | 3.8% |
| 5 | 17385 | 3.7% |
| C | 11012 | 2.4% |
| Other values (34) | 91827 |
level3Name
Text
Missing 
| Distinct | 2871 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 1887342 |
| Missing (%) | 98.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.371411744 |
| Min length | 2 |
Unique
| Unique | 1139 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Cristóbal |
|---|---|
| 2nd row | Chepillo |
| 3rd row | Myitkyina |
| 4th row | Pedro González |
| 5th row | Kenora, Unorganized |
| Value | Count | Frequency (%) |
| cativá | 3201 | 5.8% |
| nosibe | 2581 | 4.7% |
| cristóbal | 2281 | 4.1% |
| limon | 2199 | 4.0% |
| portobelo | 1729 | 3.1% |
| harbour | 745 | 1.4% |
| sachs | 743 | 1.3% |
| veracruz | 729 | 1.3% |
| santa | 615 | 1.1% |
| unorganized | 585 | 1.1% |
| Other values (3192) | 39692 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 46333 | 12.7% |
| o | 27885 | 7.6% |
| i | 25003 | 6.8% |
| n | 22399 | 6.1% |
| r | 19868 | 5.4% |
| e | 19376 | 5.3% |
| t | 17849 | 4.9% |
| 16049 | 4.4% | |
| l | 15536 | 4.2% |
| s | 13794 | 3.8% |
| Other values (115) | 141871 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 285283 | |
| Uppercase Letter | 53616 | 14.7% |
| Space Separator | 16049 | 4.4% |
| Other Punctuation | 3460 | 0.9% |
| Decimal Number | 3386 | 0.9% |
| Open Punctuation | 1574 | 0.4% |
| Dash Punctuation | 1339 | 0.4% |
| Close Punctuation | 1256 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 46333 | |
| o | 27885 | |
| i | 25003 | 8.8% |
| n | 22399 | 7.9% |
| r | 19868 | 7.0% |
| e | 19376 | 6.8% |
| t | 17849 | 6.3% |
| l | 15536 | 5.4% |
| s | 13794 | 4.8% |
| b | 11070 | 3.9% |
| Other values (62) | 66170 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9110 | |
| N | 4814 | 9.0% |
| P | 4571 | 8.5% |
| L | 4478 | 8.4% |
| S | 4471 | 8.3% |
| B | 4332 | 8.1% |
| T | 2737 | 5.1% |
| M | 2485 | 4.6% |
| A | 2379 | 4.4% |
| H | 1680 | 3.1% |
| Other values (23) | 12559 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 842 | |
| 5 | 814 | |
| 2 | 601 | |
| 3 | 231 | 6.8% |
| 0 | 194 | 5.7% |
| 4 | 191 | 5.6% |
| 6 | 160 | 4.7% |
| 7 | 129 | 3.8% |
| 8 | 116 | 3.4% |
| 9 | 108 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2306 | |
| , | 1029 | |
| ' | 100 | 2.9% |
| / | 22 | 0.6% |
| ! | 2 | 0.1% |
| * | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 16049 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1574 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1339 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 338899 | |
| Common | 27064 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 46333 | 13.7% |
| o | 27885 | 8.2% |
| i | 25003 | 7.4% |
| n | 22399 | 6.6% |
| r | 19868 | 5.9% |
| e | 19376 | 5.7% |
| t | 17849 | 5.3% |
| l | 15536 | 4.6% |
| s | 13794 | 4.1% |
| b | 11070 | 3.3% |
| Other values (95) | 119786 |
Common
| Value | Count | Frequency (%) |
| 16049 | ||
| . | 2306 | 8.5% |
| ( | 1574 | 5.8% |
| - | 1339 | 4.9% |
| ) | 1256 | 4.6% |
| , | 1029 | 3.8% |
| 1 | 842 | 3.1% |
| 5 | 814 | 3.0% |
| 2 | 601 | 2.2% |
| 3 | 231 | 0.9% |
| Other values (10) | 1023 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 357168 | |
| None | 8669 | 2.4% |
| Latin Ext Additional | 126 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 46333 | 13.0% |
| o | 27885 | 7.8% |
| i | 25003 | 7.0% |
| n | 22399 | 6.3% |
| r | 19868 | 5.6% |
| e | 19376 | 5.4% |
| t | 17849 | 5.0% |
| 16049 | 4.5% | |
| l | 15536 | 4.3% |
| s | 13794 | 3.9% |
| Other values (62) | 133076 |
None
| Value | Count | Frequency (%) |
| á | 4090 | |
| ó | 2564 | |
| í | 832 | 9.6% |
| é | 289 | 3.3% |
| ñ | 187 | 2.2% |
| à | 157 | 1.8% |
| è | 103 | 1.2% |
| â | 99 | 1.1% |
| Đ | 63 | 0.7% |
| ú | 41 | 0.5% |
| Other values (26) | 244 | 2.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ờ | 27 | |
| ậ | 25 | |
| ệ | 13 | |
| ự | 11 | |
| ả | 9 | 7.1% |
| ẩ | 8 | 6.3% |
| ứ | 8 | 6.3% |
| ớ | 6 | 4.8% |
| ế | 5 | 4.0% |
| ạ | 3 | 2.4% |
| Other values (7) | 11 |
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 469562 |
| Missing (%) | 24.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 2 |
| Mean length | 2.000048736 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | NE |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 1307916 | |
| lc | 117121 | 8.0% |
| dd | 11259 | 0.8% |
| nt | 6488 | 0.4% |
| vu | 6192 | 0.4% |
| cr | 3404 | 0.2% |
| en | 3150 | 0.2% |
| ex | 1118 | 0.1% |
| ew | 179 | < 0.1% |
| 2024-12-02t13:57:06.570z | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1317554 | |
| E | 1312363 | |
| C | 120525 | 4.1% |
| L | 117121 | 4.0% |
| D | 22518 | 0.8% |
| T | 6491 | 0.2% |
| V | 6192 | 0.2% |
| U | 6192 | 0.2% |
| R | 3404 | 0.1% |
| X | 1118 | < 0.1% |
| Other values (15) | 255 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2913660 | |
| Decimal Number | 58 | < 0.1% |
| Other Punctuation | 9 | < 0.1% |
| Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1317554 | |
| E | 1312363 | |
| C | 120525 | 4.1% |
| L | 117121 | 4.0% |
| D | 22518 | 0.8% |
| T | 6491 | 0.2% |
| V | 6192 | 0.2% |
| U | 6192 | 0.2% |
| R | 3404 | 0.1% |
| X | 1118 | < 0.1% |
| Other values (2) | 182 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 13 | |
| 1 | 9 | |
| 0 | 9 | |
| 5 | 8 | |
| 7 | 6 | |
| 4 | 4 | 6.9% |
| 3 | 3 | 5.2% |
| 9 | 3 | 5.2% |
| 6 | 2 | 3.4% |
| 8 | 1 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 6 | |
| . | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2913660 | |
| Common | 73 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 13 | |
| 1 | 9 | |
| 0 | 9 | |
| 5 | 8 | |
| - | 6 | |
| : | 6 | |
| 7 | 6 | |
| 4 | 4 | 5.5% |
| 3 | 3 | 4.1% |
| . | 3 | 4.1% |
| Other values (3) | 6 |
Latin
| Value | Count | Frequency (%) |
| N | 1317554 | |
| E | 1312363 | |
| C | 120525 | 4.1% |
| L | 117121 | 4.0% |
| D | 22518 | 0.8% |
| T | 6491 | 0.2% |
| V | 6192 | 0.2% |
| U | 6192 | 0.2% |
| R | 3404 | 0.1% |
| X | 1118 | < 0.1% |
| Other values (2) | 182 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2913733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1317554 | |
| E | 1312363 | |
| C | 120525 | 4.1% |
| L | 117121 | 4.0% |
| D | 22518 | 0.8% |
| T | 6491 | 0.2% |
| V | 6192 | 0.2% |
| U | 6192 | 0.2% |
| R | 3404 | 0.1% |
| X | 1118 | < 0.1% |
| Other values (15) | 255 | < 0.1% |